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(57) Abstract 

An isolated and purified DNA molecule encoding hyaluro- 
nan synthase-2 (Has2) is provided, as is purified and isolated 
Has2 polypeptide. Also provided is an isolated and purified DNA 
molecule encoding hyaluronan synthase-3 (Has3), as is purified 
and isolated Has3 polypeptide. 
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GENE ENCODING HYALURONAN SYNTHASE 

5 

Background of the Invention 

Hyaluronan (HA, hyaluronic acid) is a linear unbranched polymer made 
up of repeating disaccharide units of D-glucuronic acid (pl^ 3) N- 
acetylglucosamine (pi -4). HA biosynthesis requires two enzyme activities; the 

10 transfer of UDP-N-acetylglucosamine (UDP-GlcNAc) and UDP-glucuronic acid 
(UDP-GlcUA), respectively, to the growing HA chain. HA is synthesized at the 
inner face of the plasma membrane and is subsequently extruded to the outside 
of the cell. HA is a major constituent of the extracellular matrix during 
embryonic development. For example, within the developing embryo, HA 

15 accumulates at sites of cell migration and proliferation, and has been proposed to 
play important roles in craniofacial, limb, neural tube, and heart development. In 
particular, HA is essential for the formation of endocardial cushions, the 
structures required for septation and the development of heart valves. In adults, 
HA is a major constituent of the extracellular matrix of most tissues and organs, 

20 and a critical component of the vitreous humor of the eye, joint fluid and 
cartilage. 

HA is highly biocompatible and completely biodegradable, and has 
demonstrated beneficial effects when administered to the joints of arthritic race 
horses and to perforated rat tympanic membranes. HA has also been employed 

25 to protect eye tissue during artificial intraocular lens implantations, as a delivery 
agent for drugs and to prevent post-operative scarring. 

Genes which encode HA biosynthetic enzymes have been identified in 
bacteria, e.g., Group A Streptococcus (Wessels et al., Infect. Immun.. 62 . 433 
(1994); DeAngelis et al., J. Biol. Chem. . 268 . 19181 (1993); DeAngelis et al., 

30 Biochemistry . 33 . 9033 ( 1 994)). Polymerization of HA by S. pyogenes occurs 
through the action of a single enzyme, HA synthase, encoded by the hasA gene. 
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The S. pyogenes HA synthase is localized to the membrane and is predicted to 
have several transmembrane domains and a large intracellular loop 
encompassing the active site of the enzyme. Purified immobilized HasA has 
been shown to be sufficient for HA polymerization in vitro (DeAngelis et al., 



5 Biochemistry . 33 * 9033 (1994)). The transfer of the hasA gene and a second 




encodes a UDP-glucose dehydrogenase, which converts UDP-glucose to UDP- 
glucuronic acid (UDP-GIcUA), a subunit of HA. 



10 However, there is evidence that other genes are also involved in bacterial 

HA biosynthesis. A protein originally identified in Streptococcus equisimilis as 
HA synthase (Lansing et al., Biochem. J. . 2S2, 1 79 (1993)) has no sequence 
similarity to S. pyogenes HasA but has significant sequence similarity to 
bacterial proteins involved in oligopeptide binding and transport. Although the 

1 5 total amount of HA synthesized by bacterial cells overexpressing the S. 

equisimilis HA synthase increased, the length of the resultant HA chains was 
significantly shorter, suggesting that the increase may be a function of an 
elevation in the rate of HA transport from the cell (O'Regan et al., Int. J. Biol. 
Macromol. . 16 . 283 (1994)). Thus, rather than being directly involved in HA 

20 biosynthesis, the S. equisimilis HA synthase may be involved in the transport of 
HA, or may participate in HA synthesis as an accessory molecule, rather than as 
the synthase itself. 

While both bacterial and animal sources of HA exist, high molecular 
weight HA is difficult and costly to isolate and purify due to the fact that HA is 

25 complexed with proteoglycans. Moreover, both bacterial and animal sources of 
HA are increasingly under more stringent regulatory controls due to fear of 
contamination with identifiable, or as yet unidentified, infectious or toxic agents. 
Furthermore, the extensive purification process of HA polymer from cells results 
in an HA polymer of considerable molecular weight polydispersity . 
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Thus, there is a need to isolate and purify genes that encode eukaryotic 
HA biosynthetic enzymes or proteins associated with the extracellular 
accumulation of HA. 

Summary of the Invention 

5 The present invention provides an isolated.and purified DNA molecule 

comprising a preselected DNA segment encoding eukaryotic, preferably 
mammalian, hyaluronan synthase-2 (Has2), a biologically active variant thereof 
or a biologically active subunit thereof. A preferred embodiment of the 
invention is a DNA molecule comprising a preselected DNA segment, e.g., SEQ 

1 0 ID NO: 1 , that encodes murine hyaluronan synthase-2. A murine hyaluronan 
synthase-2 having SEQ ID NO:2 has 21% identity and 28% similarity to 
Streptococcal HasA, and 55% identity and 73% similarity to murine Hasl (Itano 
et al., J. Biol. Chem. . 271 . 9875 (1996); SEQ ID NO:3). Because the deduced 
amino acid sequence of Hasl is distinct from the murine hyaluronan synthase-2 

1 5 having SEQ ID NO:2, there appears to be more than one mammalian gene 

encoding an enzyme or protein which is associated with HA biosynthesis and/or 
extracellular HA accumulation. Another preferred embodiment of the invention 
is a DNA molecule comprising a preselected DNA segment, e.g M SEQ ID 
NO:23, that encodes human hyaluronan synthase-2, a polypeptide which does 

20 not have amino acid sequence identity with the human homolog of murine Has 1 
(Itano et al., BBRC . 222 . 816 (1996); SEQ ID NO:55). Also provided is an 
isolated and purified DNA molecule comprising a preselected DNA segment 
which encodes a protein that increases the amount of extracellular hyaluronan 
produced by cultured primate cells transformed so as to express said DNA 

25 segment. 

Further provided is an isolated and purified DNA molecule comprising a 
DNA segment encoding eukaryotic, preferably mammalian, hyaluronan 
synthase-3 (Has3), or a biologically active variant thereof or a biologically active 
subunit thereof. A preferred embodiment of the invention includes a preselected 
30 DNA segment comprising SEQ ID NO:3 1 which encodes a hyaluronan synthase- 
3 comprising SEQ ID NO:32. Another preferred embodiment of the invention 
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includes a DNA molecule comprising a preselected DNA segment comprising 
SEQ ID NO:25 which encodes a hyaluronan synthase-3 comprising SEQ ID 
NO:29. The DNA molecules of the invention are double-stranded or single- 
stranded, preferably, they are cDNA. 



5 An isolated and purified DNA molecule, such as a probe or a primer, e.g., 




which hybridizes under stringent conditions to the DNA molecules of the 
invention, or RNA molecules derived from these DNA molecules, is also 
provided by the invention. The term "stringent conditions" is defined 



1 0 hereinbelow. The probes or primers of the invention have at least about 80%, 

preferably at least about 90%, identity to the above-disclosed DNA sequences, or 
sequences complementary thereto. A preferred embodiment of the invention 
includes a probe or primer which has at least about 80%, preferably at least about 
90%, more preferably at least about 95%, identity to 1) SEQ ID NO: 1, 2) SEQ 

1 5 ID NO:23, 3) SEQ ID NO:25, 4) SEQ ID NO:26 or 5) SEQ ID NO:3 1 , or a 

sequence complementary thereto. The probes or primers of the invention may be 
detectably labeled or have a binding site for a detectable label. The probes or 
primers are useful to detect, quantify and/or amplify DNA strands with 
complementary to sequences related to hyaluronan synthase-2 or hyaluronan 

20 synthase-3 in eukaryotic tissue samples. The probes and primers of the present 
invention are also useful for detecting RNA molecules resulting from 
transcription of the DNA molecules of the present invention. The uses of probes 
and primers, as well as their isolation, purification and conditions under which 
they are employed for the detection or amplification of a specific gene, are well 

25 known in the art. 

The present invention also provides isolated and purified DNA molecules 
which provide "anti-sense" mRNA transcripts of the DNA sequences, including 
SEQ ID NO: 1 or SEQ ID NO:3 1 , which, when expressed from an expression 
cassette in a host cell, can alter HA expression. 

30 The present invention also provides an expression cassette comprising a 

promoter which is functional in a host cell operably linked to a preselected DNA 
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segment encoding hyaluronan synthase-2. Preferably, the expression cassette 
comprises a preselected DNA segment encoding murine hyaluronan synthase-2. 
Another preferred embodiment of the invention is an expression cassette 
comprising a preselected DNA segment encoding human hyaluronan synthase-2. 
5 The present invention further provides an expression cassette comprising 

a promoter which is functional in a host cell operably linked to a preselected 
DNA segment encoding hyaluronan synthase-3. Preferably, the expression 
cassette comprises a preselected DNA segment encoding murine hyaluronan 
synthase-3. Another preferred embodiment of the invention is an expression 

10 cassette comprising a preselected DNA segment encoding human hyaluronan 
synthase-3. Such expression cassettes can be placed into expression vectors 
which can then be employed to transform prokaryotic or eukaryotic host cells. It 
is envisioned that the vectors of the invention may be useful to transform 
mammalian cells in vivo t or in vitro with subsequent introduction of the 

1 5 transformed cells to a host organism. The in vivo delivery of the vectors may be 
accomplished by methods well known to the art, including, but not limited to, 
viral- or liposome-mediated delivery. The present cassettes can also contain a 
functional DNA sequence which is a selectable marker gene or reporter gene, as 
described below. 

20 Also provided is a transformed host cell, the genome of which has been 

augmented by a preselected DNA sequence encoding hyaluronan synthase-2, a 
preselected DNA sequence encoding hyaluronan synthase-3, or a combination 
thereof Preferably, the preselected DNA sequence is integrated into the 
chromosome of the transformed host cell, and is heritable. 

25 Expression of mouse hyaluronan synthase-2 or mouse hyaluronan 

synthase-3 in COS-1 cultured primate cells results in the formation of large well- 
pronounced HA coats, as described hereinbelow. Moreover, HA coat formation 
in COS cells transfected with an hyaluronan synthase-2 expression vector 
occurred in the absence of HA receptor expression, exogenously added HA, or 

30 proteoglycans. This suggests that hyaluronan synthase-2 expression leads to the 
synthesis of HA, in a form which is extruded through the plasma membrane and 
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may associate with the cell surface to form an HA coat through continued 
attachment to the HA synthase. 

Further provided is isolated, purified hyaluronan synthase-2 polypeptide. 
A preferred embodiment of the invention is isolated, purified murine hyaluronan 



5 synthase-2 polypeptide. Another preferred embodiment of the invention is 




Also provided is isolated, purified hyaluronan synthase-3 polypeptide. A 
preferred embodiment of the invention is isolated, purified murine hyaluronan 
synthase-3 polypeptide. Another preferred embodiment of the invention is 



10 isolated, purified hyaluronan synthase-3 polypeptide having SEQ ID NO:32. 

As used herein, the term "Has2" or "hyaluronan synthase-2" is preferably 
defined to mean a polypeptide comprising SEQ ID NO:2, as well as variants of 
SEQ ID NO:2 which have at least about 80%, preferably at least about 90%, 
identity or homology to SEQ ID NO:2, or a biologically active subunit thereof, 

1 5 Biologically active subunits of hyaluronan synthase-2, variant hyaluronan 

synthase-2 polypeptides and biologically active subunits thereof, falling within 
the scope of the invention have at least about 50%, preferably at least about 80%, 
and more preferably at least about 90%, the activity of the hyaluronan synthase-2 
polypeptide comprising SEQ ID NO:2. The activity of an hyaluronan synthase-2 

20 polypeptide can be measured by methods well known to the art including, but 
not limited to, the particle exclusion assay described hereinbelow, an 
immunoassay which detects HA production, as described by Itano et al. ( J. Biol. 
Chem.. 271. 9875 (1996)), HA synthase activity of crude membrane 
preparations, as described by Itano et al. {supra), or HA synthase activity of cell 

25 lysate preparations, as described by Meyer et al. ( Proc. Natl. Acad. Sci. USA . 93 . 
4543 (1996)). 

As used herein, the term "Has3" or "hyaluronan synthase-3" is preferably 
defined to mean a polypeptide comprising SEQ ID NO:32, SEQ ID NO:29, or a 
biologically active subunit thereof, as well as variants of SEQ ID NO:32 or SEQ 
30 ID NO:29 and subunits thereof which have at least about 80%, preferably at least 
about 90%, identity or homology to SEQ ID NO:32 or SEQ ID NO:29, 
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respectively. Biologically active subunits of hyaluronan synthase-3, variant 
hyaluronan synthase-3 polypeptides and biologically active subunits thereof, 
falling within the scope of the invention have at least about 50%, preferably at 
least about 80%, and more preferably at least about 90%, the activity of the 
5 hyaluronan synthase-3 polypeptide comprising SEQ ID NO:32 or SEQ ID 
NO:29. The activity of an hyaluronan synthase-3 polypeptide can be measured 
by the methods described above for hyaluronan synthase-2. 

The present invention also provides a method to produce hyaluronan 
synthase-2, comprising: culturing a host cell, preferably a primate host cell, 

10 transformed with a nucleic acid molecule comprising a DNA segment encoding 
hyaluronan synthase-2 operably linked to a promoter, so that said host cell 
expresses said hyaluronan synthase-2. The method also preferably provides 
isolated recombinant hyaluronan synthase-2 polypeptide which is recovered 
from the transformed host cells. 

1 5 Also provided is a method to produce hyaluronan synthase-3, 

comprising: culturing a host cell transformed with a nucleic acid molecule 
comprising a DNA segment encoding hyaluronan synthase-3 operably linked to 
a promoter, so that said host cell expresses said hyaluronan synthase-3. The 
method also preferably provides isolated recombinant hyaluronan synthase-3 

20 polypeptide which is recovered from the transformed host cells. Optionally, host 
cells can be co-transformed with a nucleic acid molecule comprising a DNA 
segment encoding hyaluronan synthase-3 operably linked to a promoter and a 
nucleic acid molecule comprising a DNA segment encoding hyaluronan 
synthase-2 operably linked to a promoter. 

25 Further provided is a method of altering the amount of hyaluronan 

produced by a cell. The method comprises introducing into a host cell a 
preselected DNA segment encoding hyaluronan synthase-2 operably linked to a 
promoter so as to yield a transformed host cell. The preselected DNA segment is 
expressed as hyaluronan synthase-2 in the transformed host cell in an amount 

30 that results in the transformed host cell producing an altered, preferably 
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increased, amount of hyaluronan relative to the amount of hyaluronan produced 
by a corresponding untransformed host cell. 

Also provided is a method of altering the amount of hyaluronan produced 
by a cell. The method comprises introducing into a host cell a preselected DNA 
segment encoding hyaluronan synthase-3 operably linked to a promoter so as to 



hyaluronan synthase-3 in the transformed host cell in an amount that results in 
the transformed host cell producing an altered, preferably increased, amount of 
hyaluronan relative to the amount of hyaluronan produced by a corresponding 
1 0 untransformed host cell. 

Once isolated and purified, the genes involved in HA biosynthesis and 
extracellular accumulation of HA can be employed to synthesize HA in vitro. 
Because in vitro synthesized HA is of extremely high purity, is free from 
bacterial and animal cell contaminants, and can be optimized as to its 
1 5 physicochemical properties, it is a preferred source of HA relative to HA derived 
from bacterial or animal sources. Thus, the invention provides a method to 
prepare HA which comprises contacting an amount of hyaluronan synthase-2, an 
amount of hyaluronan synthase-3, or a combination thereof, with a mixture of 
components under conditions effective to yield hyaluronan. 
20 Moreover, the identification of genes involved in HA biosynthesis and/or 

coat formation may also be useful for defining the molecular basis for genetic 
diseases which are associated with a deficiency in HA biosynthesis, such as 
cartilage pathologies, for providing a clinically useful diagnostic test or in 
molecular-based therapeutics. Furthermore, the cloning of these genes will help 
25 to elucidate the molecular mechanism giving rise to the alteration of the protein 
encoded by the gene in patients having a particular disorder, e.g., a cartilage 
deficiency associated with reduced HA biosynthesis. 

Thus, the invention provides a method to prevent or treat a condition 
associated with an alteration in HA synthesis or extracellular accumulation. The 
30 method comprises administering to a mammal afflicted with, or at risk of, said 
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condition an amount of mammalian hyaluronan synthase-2 effective to alter HA 

synthesis or extracellular accumulation. 

The invention also provides a method to prevent or treat a condition 

associated with an alteration in HA synthesis or extracellular accumulation, 
5 comprising: administering to a mammal afflicted with, or at risk of, said 

condition an amount of mammalian hyaluronan synthase-3 effective to alter HA 

synthesis or extracellular accumulation. 

Also provided is a method to identify a mammal afflicted with, or at risk 

of, a condition associated with aberrant HA synthesis or extracellular 
1 0 accumulation. The method comprises contacting an agent that binds to 

mammalian hyaluronan synthase-2 with a mammalian sample suspected of 

containing hyaluronan synthase-2 so as to form a complex. Then the presence or 

amount of complex formation is detected or determined and the presence or 

amount of complex formation is correlated with the presence or absence of the 
15 condition. 

The invention also provides a method to identify a mammal afflicted 
with, or at risk of, a condition associated with aberrant HA synthesis or 
extracellular accumulation which employs an agent that binds to mammalian 
hyaluronan synthase-3. The agent is contacted with a mammalian sample 

20 suspected of containing hyaluronan synthase-3 so as to form a complex. The 
presence or amount of complex formation is detected or determined and the 
presence or amount of complex formation is correlated with the presence or 
absence of the condition. 

Further provided is a method for detecting hyaluronan synthase-2 DNA. 

25 The method comprises contacting an amount of DNA obtained by reverse 

transcription of RNA from a mammalian physiological sample which comprises 
cells suspected of containing hyaluronan synthase-2 RNA, with an amount of at 
least two oligonucleotides under conditions effective to amplify the DNA by a 
polymerase chain reaction so as to yield an amount of amplified hyaluronan 

30 synthase-2 DNA. At least one oligonucleotide is an hyaluronan synthase-2- 
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specific oligonucleotide. The presence or amount of the amplified hyaluronan 
synthase-2 DNA is then detected. 

The invention also provides a method for detecting hyaluronan synthase- 
3 DNA. The method comprises contacting an amount of DNA obtained by 
reverse transcription of RNA from a mammalian physiological sample which 

amount of at least two oligonucleotides under conditions effective to amplify the 
DNA by a polymerase chain reaction so as to yield an amount of amplified 
hyaluronan synthase-3 DNA. At least one oligonucleotide is an hyaluronan 
1 0 synthase-3 -specific oligonucleotide. The presence or amount of the amplified 
hyaluronan synthase-3 DNA is detected. 

As used herein, the term "hyaluronan synthase-2-specific 
oligonucleotide" or "hyaluronan synthase-3-specific oligonucleotide" means a 
DNA sequence that has at least about 80%, preferably at least about 90%, and 
15 more preferably at least about 95%, sequence identity with SEQ ID NO:l or 
SEQ ID NO:23 (has2), or SEQ ID NO:25, SEQ ID NO:29 or SEQ ID NO:32 
(has3), respectively. An oligonucleotide or primer of the invention has at least 
about 7-50, preferably about 10-40, and more preferably about 15-35, 
nucleotides. Preferably, the oligonucleotide primers of the invention comprise at 
20 least 7 nucleotides at their 3' end which have at least about 85% identity to SEQ 
ID NO: 1 , SEQ ID NO:23, SEQ ID NO:25. SEQ ID NO:29 or SEQ ID NO:32. 
The oligonucleotides of the invention may also include sequences which are 
unrelated to has sequences. 

Further provided is a method for detecting a condition associated with 
25 aberrant HA synthesis or extracellular accumulation. The method comprises 
contacting an amount of DNA obtained by reverse transcription of RNA from a 
mammalian physiological sample which comprises cells suspected of containing 
hyaluronan synthase-2 RNA, with an amount of at least two oligonucleotides 
under conditions effective to amplify the DNA by a polymerase chain reaction so 
30 as to yield an amount of amplified hyaluronan synthase-2 DNA. Alternatively, 
or concurrently, an amount of DNA obtained by reverse transcription of RNA 
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from a mammalian physiological sample which comprises cells suspected of 
containing hyaluronan synthase-3 RNA is contacted with an amount of at least 
two oligonucleotides under conditions effective to amplify the DNA by a 
polymerase chain reaction so as to yield an amount of amplified hyaluronan 
5 synthase-3 DNA. Then the presence or amount of the amplified hyaluronan 
synthase-2 and/or hyaluronan synthase-3 DNA is detected. The presence or 
amount of hyaluronan synthase-2 DNA is indicative of the presence of the 
condition in said mammal and/or hyaluronan synthase-3. 

The invention also provides a therapeutic method in which an amount of 

10 an agent that alters the activity of native hyaluronan synthase-2, native 

hyaluronan synthase-3, or a combination thereof, is administered to a mammal. 

Brief Description of the Figures 
Figure 1 . Degenerate RT-PCR analysis. An agarose gel is shown which 
depicts polymerase chain reaction (PCR) amplified bands characteristic of a 

1 5 typical RT-PCR experiment. RT-PCR was performed on total RNA isolated 
from 10.5 days post coitum (dpc) (E 10.5) and 14.5 dpc (E 14.5) C57BL/6J 
mouse embryos. M, indicates 1 kilobase pair ladder (GIBCO-BRL/Life 
Technologies, Gaithersburg, MD). DEG1/3 indicates degenerate primer pools 1 
and 3. DEG 1/5 indicates degenerate primer pools 1 and 5. 

20 Figure 2. cDNA library clones. The extent of overlapping cDNA clones 

is shown in relation to the mouse Has2 cDNA and to the degenerate RT-PCR 
mouse Has2 cDNA clone, MHas300. The positions of the translation initiation 
codon (ATG), the translation termination codon (TGA), and the internal EcoRI 
restriction endonuclease site (E) are indicated. 

25 Figure 3. Nucleotide sequence encoding, and corresponding amino acid 

sequence of, mouse Has2 (SEQ ID NO.T and SEQ ID NO:2, respectively). The 
5' and 3' untranslated nucleotide sequences are shown in lowercase, whereas the 
open reading frame is shown in uppercase. The stop codon, consensus 
polyadenylation signals, CA repeat and TA repeat are underlined. 

30 Figure 4. Alignment of mouse Has2 with mouse Hasl (Itano et al., J. 

Biol. Chem. . 271 . 9875 (1996)) (SEQ ID NO:3), Xenopus laevis DG42 (SEQ ID 
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NO:4), Streptococcus pyogenes HasA (SEQ ID NO:5), and Rhizobium meliloti 
NodC (SEQ ID NO:6). Identical residues are boxed. Dashes indicate gaps that 
have been introduced to maximize the identity. Asterisks below the line indicate 
positions at which there have been conservative amino acid substitutions. 



5 Figure 5. Alignment of two regions of mouse Has2 (SEQ ID NOs:7 and 




and SEQ ID NO:40), X laevis DG42 (SEQ ID NO: 10 and SEQ ID NO:42), 5 



pyogenes HasA (SEQ ID NO:l 1 and SEQ ID NO:44), R. meliloti NodC (SEQ ID 
NO: 12 and SEQ ID NO:46) and 5. cerevisiae chitin synthase 2 (Chs2) (SEQ ID 
1 0 NO: 1 3 and SEQ ID NO:45). Dashes represent gaps that have been introduced to 
maximize homology. Residues highlighted in bold type are those that have been 
demonstrated to be critical in terms of enzyme activity of Chs2 (see Nagahashi et 
al., J. Biol. Chem. . 27Q, 1 3961 (1 995)) and that are conserved in all six 
sequences. 

1 5 Figure 6. Kyte-Doolittle hydrophilicity plots and linear cartoon 

representation of mouse Has2 protein. A) Comparison of mouse Has2, mouse 
Hasl and Streptococcus pyogenes HasA by Kyte-Doolittle hydrophilicity plots. 
The amino acid sequences of mouse Has2, mouse HAS (Hasl) and bacterial 
HasA were analyzed using the Kyte-Doolittle algorithm (Mac Vector) with a 

20 hydrophilicity window size of 15. Strongly hydrophobic areas of the proteins 
are indicated below the axes. Areas predicted to be potential transmembrane 
domains or signal peptide are indicated by the black bars below each plot. B) 
Linear representation of mouse Has2 predicted protein. Hydrophobic areas are 
indicated by the filled black boxes. Consensus B(X 7 )B HA binding motifs 

25 (HABM) are indicated by the filled gray boxes and are numbered. These motifs 
correspond to amino acid residues 100-108, 107-1 15, 420-428, and 460-468. 
The predicted intracellular loop of the molecule is indicated. 

Figure 7. Northern analyses of mouse Has2 expression. Multiple tissue 
Northern blots of polyA^ RNA isolated from mouse embryos and adult tissues 

30 were hybridized with a mouse Has2 ORF cDNA probe. The relative positions of 
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probe was employed as an internal control. 

Figure 8. Southern analysis of mouse Has2. Total 129Sv/J mouse 
genomic DNA was digested with the restriction enzymes, E (EcoRI), B 
5 (BamHI), H (HindlH), and S (Sad) and probed with a labeled mouse Has2 ORF 
cDNA. "M" indicates 1 kilobase pair ladder. 

Figure 9. COS-1 cells expressing mouse Has2 hyaluronan coats. HA 
coats were detected by a particle exclusion assay (see Clarris et ah, Exp. Cell 
Res. . 49, 181 (1986)). (A) Mouse 3T6 embryonic fibroblasts. (B) COS-1 cells. 

10 (C) COS-1 cells co-transfected with a P-gal expression vector and pCIneo 

control vector. (D-I) COS-1 cells co-transfected with a vector which expresses 
mouse Has2 and a vector which expresses P-gal. (E) Co-transfected COS-1 cells 
which were maintained in starvation-medium. (F and I) Co-transfected COS-1 
cells stained for p-gal activity. (H) Co-transfected COS-1 cells which were 

15 maintained in starvation-medium containing hyaluronidase. 

Figure 10. (A) Partial nucleotide sequence of human hyaluronan 
synthase-2 (SEQ ID NO:23). (B) Nucleotide sequence alignment of human 
hyaluronan synthase-2 (SEQ ID NO:23) and mouse hyaluronan synthase-2 (SEQ 
ID NO: 1). (C) Amino acid sequence alignment of human hyaluronan synthase-2 

20 (SEQ ID NO:24) and mouse hyaluronan synthase-2 (SEQ ID NO:2). 

Figure 1 1 . (A) Partial nucleotide sequence of human hyaluronan 
synthase-3 (SEQ ID NO:25). (B) Partial nucleotide sequence of murine 
hyaluronan synthase-3 (SEQ ID NO:26). (C) Nucleotide sequence alignment of 
human hyaluronan synthase-3 (SEQ ID NO:25) and mouse hyaluronan synthase- 

25 3 (SEQ ID NO:26). (D) Amino acid sequence alignment of human hyaluronan 
synthase-3 (SEQ ID NO:27) and mouse hyaluronan synthase-3 (SEQ ID NO:28). 

Figure 12. (A) Amino acid sequence alignment of a partial sequence for 
human hyaluronan synthase-3 (Has3) (SEQ ID NO:29) with the equivalent 
sequence of mouse Has3 (SEQ ID NO:30). Conserved amino acids are 

30 indicated by a dash (-). (B) Nucleotide (SEQ ID NO:3 1) and predicted amino 
acid (SEQ ID NO: 32) sequence of the Has3 open reading frame. Sequences 
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representing consensus HA binding motifs are underlined. The location of three 
introns within the gene are indicated by arrowheads. The first intron is located 
immediately preceding the start codon (ATG). 

Figure 13. Northern blot depicting the expression of mouse Has3 at four 



5 different stages of mouse embryonic development. A cDNA probe representing 




mouse embryonic polyA+ RNAs (CLONTECH) under conditions recommended 
by the manufacturer. 



Figure 14. (A) Amino acid sequence alignment of mouse Has3 (SEQ ID 

10 NO:32) with mouse Has2 (Mhas2) (SEQ ID NO:2), mouse Hasl (Mhasl) (SEQ 
ID NO:3), Xenopus laevis DG42 (DG42) (SEQ ID NO:4) and Streptococcus 
pyogenes HasA (SEQ ID NO:5). Conserved residues are boxed. Gaps have 
been introduced to maximize the alignment. Asterisks indicate positions at 
which there have been significant conservative amino acid substitutions. (B) 

1 5 Alignment of two regions of the mouse Has3 protein sequence (SEQ ID NO:35 
and SEQ ID NO:36, respectively) with equivalent regions of related 
glycosyltransferases including mouse Has2 (SEQ ID NO:7 and SEQ ID NO:2, 
respectively), mouse Hasl (SEQ ID NO:9 and SEQ ID NO:40, respectively), 
Xenopus DG42 (SEQ ID NO: 1 0 and SEQ ID NO:42, respectively), S. pyogenes 

20 HasA (SEQ ID NO: 1 1 and SEQ ID NO:44, respectively), Rhizobium meliloti 
NodC (SEQ ID NO: 12 and SEQ ID NO:46, respectively), Gossypium hirsutum 
putative cellulose synthase Al (celAl) (SEQ ID NO:47 and SEQ ID NO:48, 
respectively) and Saccharomyces cerevisiae Chitin synthase 2 (Chs2) (SEQ ID 
NO: 15 and SEQ ID NO:45, respectively). Site-directed mutagenesis of the 

25 residues highlighted in bold of yeast Chs2 resulted in loss of enzymatic activity 
(Nagahashi et al., J. Biol. Chem. . 2Z£>, 13961 (1995)), suggesting that these 
residues may be critical for pi— 4 glycosyltransferase activity. (C) Kyte-Doolittle 
hydrophilicity plots of mouse Has3, mouse Has2, mouse Hasl and S. pyogenes 
HasA. Hydrophobic areas are represented below the axes. Potential 

30 transmembrane domains are indicated by black bars drawn below each plot. 
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Figure 15. COS-1 cells expressing mouse Has 3 hyaluronan coats. HA 
coats were detected as described in the legend to Figure 9. (A) COS- 1 cells co- 
transfected with a P-gal expression vector and a vector which expresses mouse 
Has2. (B) COS-1 cells co-transfected with a p-gal expression vector and pClneo 
5 control vector. (C) COS-1 cells co-transfected with a vector which expresses 
mouse Has3 and a vector which expresses P-gal before mock treatment with 
hyaluronidase. (D) COS-1 cells co-transfected with a vector which expresses 
mouse Has3 and a vector which expresses p-gal after mock treatment with 
hyaluronidase. (E) COS-1 cells co-transfected with a vector which expresses 
10 mouse Has3 and a vector which expresses P-gal before treatment with 

hyaluronidase. (F) COS-1 cells co-transfected with a vector which expresses 
mouse Has3 and a vector which expresses P-gal after treatment with 
hyaluronidase. 

Detailed Description of the Invention 

15 Definitions 

"Southern analysis" or "Southern blotting" is a method by which the 
presence of DNA sequences in a restriction endonuclease digest of DNA or 
DNA-containing composition is confirmed by hybridization to a known, labeled 
oligonucleotide or DNA fragment. Southern analysis typically involves 

20 electrophoretic separation of DNA digests on agarose gels, denaturation of the 
DNA after electrophoretic separation, and transfer of the DNA to nitrocellulose, 
nylon, or another suitable membrane support for analysis with a radiolabeled, 
biotinylated, or enzyme-labeled probe as described in sections 9.37-9.52 of 
Sambrook et al., Molecular Cloning: A Laboratory Manual . Cold Spring Harbor 

25 (1989). 

"Northern analysis" or "Northern blotting" is a method used to identify 
RNA sequences that hybridize to a known probe such as an oligonucleotide, 
DNA fragment, cDNA or fragment thereof, or RNA fragment. The probe is 
labeled with a radioisotope such as 32 P, by biotinylation or with an enzyme. The 
30 RNA to be analyzed can be usually electro phoretically separated on an agarose 
or polyacrylamide gel, transferred to nitrocellulose, nylon, or other suitable 
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membrane, and hybridized with the probe, using standard techniques well known 
in the art such as those described in sections 7.39-7.52 of Sambrook et al., supra. 

"Polymerase chain reaction" or "PCR" refers to a procedure or technique 
in which amounts of a preselected fragment of nucleic acid, RNA and/or DNA, 



5 are amplified as described in U.S. Patent No. 4,683,195. Generally, sequence 




design oligonucleotide primers. These primers will be identical or similar in 
sequence to opposite strands of the template to be amplified. PCR can be used to 
amplify specific RNA sequences, specific DNA sequences from total genomic 



10 DNA, and cDNA transcribed from total cellular RNA, bacteriophage or plasmid 
sequences, and the like. See generally Mullis et al., Cold Spring Harbor Svmp. 
Quant. 11, 263 (1987); Erlich, ed., PCR Technology , (Stockton Press, NY, 

1989). 

As used herein "stringent conditions" means conditions that detect a 

15 nucleic acid molecule with at least 80%, preferably at least 90%, nucleotide 
sequence homology to the probe or primer sequence. See Sambrook et al., 
Molecular Cloning: A Laboratory Manual. Cold Spring Harbor Laboratory Press 
(2nd ed., 1989) for selection of hybridization and washing conditions for 
DNA:DNA, as well as DNA:RNA (Northern blot), stable and specific duplex 

20 formation. Stringent conditions are those that (1) employ low ionic strength and 
high temperature for washing, for example, 0.015 M NaCl/0.0015 M sodium 
citrate (SSC); 0.1% sodium lauryl sulfate (SDS) at 50°C, or (2) employ a 
denaturing agent such as formamide during hybridization, e.g., 50% formamide 
with 0.1% bovine serum albumin/0.1% Ficoll/0.1% polyvinylpyrrolidone/50 

25 mM sodium phosphate buffer at pH 6.5 with 750 mM NaCl, 75 mM sodium 
citrate at 42°C. Another example is use of 50% formamide, 5 x SSC (0.75 M 
NaCl, 0.075 M sodium citrate), 50 mM sodium phosphate (pH 6.8), 0.1% 
sodium pyrophosphate, 5 x Denhardt's solution, sonicated salmon sperm DNA 
(50 ^ig/ml), 0.1% sodium dodecyl sulfate (SDS), and 10% dextran sulfate at 

30 42°C, with washes at 42°C in 0.2 x SSC and 0.1% SDS. 
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Sources of Nuc leic Acids Encoding Has2 or Has3 

A mouse gene has been recently identified that encodes a putative HA 
synthase, Hasl (Itano et al., J. Biol. Chem. . 2ZL 9875 (1996)). However, the 
results of a complementation analysis conducted by Itano et al. during the 
5 isolation of the Hasl gene indicated that in the mouse, there are at least three 
genes that are involved in HA biosynthesis. Sources of nucleotide sequences 
from which these other genes, i.e., the present DNA molecules encoding Has2 or 
Has3, can be derived include total or polyA* RNA from eukaryotic, preferably 
mammalian, embryonic cells, or mesothelioma and Wilms* tumors or cell lines 
10 derived therefrom, as well as RNA isolated from embryonic tissue samples of 
cartilage, heart, neural tube and the like. Other sources of the DNA molecules of 
the invention include genomic DNA or cDNA libraries derived from any 
eukaryotic source including other mammals, e.g., rat, bovine, equine and the 
like, and other primates, e.g., humans and monkeys. 

15 

Isolation of a Ge ne Encoding Has2 or Has3 

A nucleic acid molecule encoding mammalian HA biosynthetic enzymes, 
such as Has2 or Has3, can be identified and isolated using standard methods, as 
described by Sambrook et al., Molecular Cloning: A Laboratory Manual . Cold 

20 Spring Harbor, NY (1989). For example, degenerate reverse-transcriptase PCR 
(RT-PCR) can be employed to isolate and clone Has2 or Has3 genes. This 
approach relies upon conserved sequences deduced from alignments of related 
gene or protein sequences. Sequence analysis of the has A gene of S. pyogenes 
predicts that the HA synthase is a membrane protein with a large intracellular 

25 loop encoding the active site of the enzyme (DeAngelis et al., J. Biol. Chem. . 
268 , supra). Similarly, in mammalian cells, the HA synthase has been localized 
to the plasma membrane, with the active site on the inner face of the membrane 
(Philipson et al., J. Biol. Chem. . 259 . 501 7 (1984); Prehm, Biochem. J. . 220 . 597 
( 1 984)). Moreover, database searches have identified the Rhizobium sp. 

30 nodulation factor C (NodC) proteins, the Saccharomyces cerevisiae chitin 
synthase 2 (Chs2) proteins, and the Xenopus laevis DG42 protein as sharing 
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sequence identity with HasA (DeAngelis, et al., Biochem. Biophvs. Res. 

Comnwn,, m l (1994)). 

At least two degenerate primer pools for RT-PCR are prepared, one of 
which is predicted to anneal to the antisense strand, and one of which is 



5 predicted to anneal to the sense strand of a putative eukaryotic DNA molecule 




highly conserved regions of the proteins which were compared to generate the 
primers. 



One degenerate primer pool is then utilized for the first-strand synthesis. 

10 RNA is isolated, e.g., using TRIZOL™ reagent (GIBCO-BRL/Life Technologies, 
Gaithersburg, MD). Reverse transcription reactions are performed on a source of 
nucleic acid believed to contain the DNA or RNA sequences of interest, e.g., 
total RNA isolated from mouse embryos. 

Resultant first-strand cDNAs are then amplified in separate PCR 

15 reactions. The products of each PCR reaction are separated via an agarose gel 
and all consistently amplified products are gel-purified and cloned directly into a 
suitable vector, such as a plasmid vector. The resultant plasmids are subjected to 
restriction endonuclease and dideoxy sequencing of double-stranded plasmid 
DNAs. 

20 Another approach to identify, isolate and clone genes which encode 

mammalian HA biosynthetic enzymes is to screen a cDNA library generated 
from embryonic heart or cartilage tissue. Screening for DNA fragments that 
encode all or a portion of the gene encoding Has2 or Has3 can be accomplished 
by probing the library with a probe, which has sequences that are highly 

25 conserved between genes believed to be related to Has2 or Has3, e.g., Hasl , 

HasA, DG42 or NodC, or by screening of plaques for binding to antibodies that 
specifically recognize Has2 or Has3 related proteins. DNA fragments that bind 
to a probe having sequences which are related to Has2 or Has3, or which are 
immunoreactive with antibodies to Has2 or Has3 related proteins, can be 

30 subcloned into a suitable vector and sequenced and/or used as probes to identify 
other cDNA or genomic sequences encoding all or a portion of Has2 or Has3. 
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As used herein, the terms "isolated and/or purified" refer to in vitro 
isolation of a DNA or polypeptide molecule from its natural cellular 
environment, and from association with other components of the cell, such as 
nucleic acid or protein, so that it can be sequenced, replicated, and/or expressed. 
5 For example, "isolated Has2 nucleic acid" is RNA or DNA containing greater 
than 7, preferably 15, and more preferably 20 or more, sequential nucleotide 
bases that encode a biologically active Has2 polypeptide or a fragment thereof, 
or a biologically active variant Has2 polypeptide or a fragment thereof, that is 
complementary to the non-coding strand, or complementary to the coding strand, 

10 of the native Has2 polypeptide RNA or DNA, or hybridizes to said RNA or 
DNA and remains stably bound under stringent conditions. 

"Isolated Has3 nucleic acid" is RNA or DNA containing greater than 7, 
preferably 15, and more preferably 20 or more, sequential nucleotide bases that 
encode a biologically active Has3 polypeptide or a fragment thereof, or a 

1 5 biologically active variant Has3 polypeptide or a fragment thereof, that is 

complementary to the non-coding strand, or complementary to the coding strand, 
of the native Has3 polypeptide RNA or DNA, or hybridizes to said RNA or 
DNA and remains stably bound under stringent conditions. Thus, the RNA or 
DNA is "isolated" in that it is free from at least one contaminating nucleic acid 

20 with which it is normally associated in the natural source of the RNA or DNA 
and is preferably substantially free of any other mammalian RNA or DNA. The 
phrase "free from at least one contaminating source nucleic acid with which it is 
normally associated" includes the case where the nucleic acid is reintroduced 
into the source or natural cell but is in a different chromosomal location or is 

25 otherwise flanked by nucleic acid sequences not normally found in the source 
cell. An example of isolated Has2 nucleic acid is RNA or DNA that encodes a 
biologically active Has2 polypeptide sharing at least about 80%, preferably at 
least about 90%, sequence identity with the Has2 polypeptide of Figure 3. An 
example of isolated Has3 nucleic acid is RNA or DNA that encodes a 

30 biologically active Has3 polypeptide sharing at least about 80%, preferably at 
least about 90%, sequence identity with the Has3 polypeptide of Figure 12B. 
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As used herein, the term "recombinant nucleic acid" or "preselected 
nucleic acid," e.g., "recombinant DNA sequence or segment" or "preselected 
DNA sequence or segment" refers to a nucleic acid, i.e., to DNA that has been 
derived or isolated from any appropriate tissue source, that may be subsequently 



5 chemically altered in vitro, so that its sequence is not naturally occurring, or 




would be positioned in a genome which has not been transformed with 
exogenous DNA. An example of preselected DNA "derived" from a source, 
would be a DNA sequence that is identified as a useful fragment within a given 



1 0 organism, and which is then chemically synthesized in essentially pure form. An 
example of such DNA "isolated" from a source would be a useful DNA sequence 
that is excised or removed from said source by chemical means, e.g., by the use 
of restriction endonucleases, so that it can be further manipulated, e.g., 
amplified, for use in the invention, by the methodology of genetic engineering. 

1 5 Thus, recovery or isolation of a given fragment of DNA from a restriction 

digest can employ separation of the digest on polyacrylamide or agarose gel by 
electrophoresis, identification of the fragment of interest by comparison of its 
mobility versus that of marker DNA fragments of known molecular weight, 
removal of the gel section containing the desired fragment, and separation of the 

20 gel from DNA. See Lawn et al M Nucleic Acids Res. . £, 6103 (1981), and 
Goeddel et al., Nucleic Acids Res. . £, 4057 (1980). Therefore, "preselected 
DNA" includes completely synthetic DNA sequences, semi-synthetic DNA 
sequences, DNA sequences isolated from biological sources, and DNA 
sequences derived from RNA, as well as mixtures thereof. 

25 As used herein, the term "derived" with respect to a RNA molecule 

means that the RNA molecule has complementary sequence identity to a 
particular DNA molecule. 

Variant? of the PNA Molecules of the Invention 
30 Nucleic acid molecules encoding amino acid sequence variants of Has2 

or Has3 are prepared by a variety of methods known in the art. These methods 
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include, but are not limited to, isolation from a natural source (in the case of 
naturally occurring amino acid sequence variants) or preparation by 
oligonucleotide-mediated (or site-directed) mutagenesis, PCR mutagenesis, and 
cassette mutagenesis of a DNA molecule encoding an earlier prepared variant or 
5 a non- variant version of Has2 or Has3 polypeptide. 

Oligonucleotide-mediated mutagenesis is a preferred method for 
preparing amino acid substitution variants of Has2 or Has3. This technique is 
well known in the art as described by Adelman et ah, DNA , Z* 183 (1983). 
Briefly, Has2 or Has3 DNA is altered by hybridizing an oligonucleotide 

1 0 encoding the desired mutation to a DNA template, where the template is the 
single-stranded form of a plasmid or bacteriophage containing the unaltered or 
native DNA sequence of Has2 or Has3. After hybridization, a DNA polymerase 
is used to synthesize an entire second complementary strand of the template that 
will thus incorporate the oligonucleotide primer, and will code for the selected 

1 5 alteration in the Has2 or Has3 DNA. 

Generally, oligonucleotides of at least 25 nucleotides in length are used. 
An optimal oligonucleotide will have 12 to 15 nucleotides that are completely 
complementary to the template on either side of the nucleotide(s) coding for the 
mutation. This ensures that the oligonucleotide will hybridize properly to the 

20 single-stranded DNA template molecule. The oligonucleotides are readily 

synthesized using techniques known in the art such as that described by Crea et 
ah, Proc. Natl. Acad. Sci. U.S.A. . 5765 (1978). 

The DNA template can be generated by those vectors that are either 
derived from bacteriophage Ml 3 vectors (the commercially available M13mpl 8 

25 and Ml 3mpl 9 vectors are suitable), or those vectors that contain a single- 
stranded phage origin of replication as described by Viera et al., Meth. EnzymoL . 
152, 3 (1987). Thus, the DNA that is to be mutated may be inserted into one of 
these vectors to generate single-stranded template. Production of the single- 
stranded template is described in Sections 4.21-4.41 of Sambrook et al., 

30 Molecular Cloning; A Laboratory Manual (Cold Spring Harbor Laboratory 
Press, N.Y. 1989). 
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Alternatively, single-stranded DNA template may be generated by 
denaturing double-stranded plasmid (or other) DNA using standard techniques. 

For alteration of the native DNA sequence (to generate amino acid 
sequence variants, for example), the oligonucleotide is hybridized to the single- 
stranded template under suitable hybridization conditions. A DNA polymerizing 

synthesize the complementary strand of the template using the oligonucleotide as 
a primer for synthesis. A heteroduplex molecule is thus formed such that one 
strand of DNA encodes the mutated form of the Has2 or Has3, and the other 
1 0 strand (the original template) encodes the native, unaltered sequence of the Has2 
or Has3, respectively. This heteroduplex molecule is then transformed into a 
suitable host cell, usually a prokaryote such as E. coli JM101 . After the cells are 
grown, they are plated onto agarose plates and screened using the 
oligonucleotide primer radiolabeled with 32-phosphate to identify the bacterial 
1 5 colonies that contain the mutated DNA. The mutated region is then removed and 
placed in an appropriate vector for protein production, generally an expression 
vector of the type typically employed for transformation of an appropriate host. 

The method described immediately above may be modified such that a 
homoduplex molecule is created wherein both strands of the plasmid contain the 
20 mutations(s). The modifications are as follows: The single-stranded 

oligonucleotide is annealed to the single-stranded template as described above. 
A mixture of three deoxyribonucleotides, deoxyriboadenosine (dATP), 
deoxyriboguanosine (dGTP), and deoxyribothymidine (dTTP), is combined with 
a modified thiodeoxyribocytosine called dCTP-(aS) (which can be obtained from 
25 the Arnersham Corporation). This mixture is added to the template- 

oligonucleotide complex. Upon addition of DNA polymerase to this mixture, a 
strand of DNA identical to the template except for the mutated bases is 
generated. In addition, this new strand of DNA will contain dCTP-(aS) instead 
of dCTP, which serves to protect it from restriction endonuclease digestion. 
30 After the template strand of the double-stranded heteroduplex is nicked 

with an appropriate restriction enzyme, the template strand can be digested with 
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ExoIII nuclease or another appropriate nuclease past the region that contains the 
site(s) to be mutagenized. The reaction is then stopped to leave a molecule that 
is only partially single-stranded. A complete double-stranded DNA homoduplex 
is then formed using DNA polymerase in the presence of all four 
5 deoxyribonucleotide triphosphates, ATP, and DNA ligase. This homoduplex 
molecule can then be transformed into a suitable host cell such as E, coli JM101 . 

A preferred embodiment of the invention is an isolated and purified DNA 
molecule comprising a preselected DNA segment encoding an Has2 polypeptide 
having SEQ ID NO:2, wherein the DNA segment comprises SEQ ID NO:l , or 

1 0 variants of SEQ ID NO: 1 having nucleotide substitutions which are "silent." 
That is, when nucleotide substitutions are present in a codon, the same amino 
acid is encoded by the codon with the nucleotide substitution as is encoded by 
the codon without the substitution. For example, leucine is encoded by the 
codon CTT, CTC, CTA and CTG. A variant of SEQ ID NO: 1 at the seventh 

1 5 codon (CTA in SEQ ID NO: 1 ) includes the substitution of CTL CTC or CTQ 
for CTA- Other "silent" nucleotide substitutions in SEQ ID NO:l which can 
encode a polypeptide having SEQ ID NO:2 can be ascertained by reference to 
page Dl in Appendix D in Sambrook et al., Molecular Cloning: A Laboratory 
Manual (1 989). Nucleotide substitutions can be introduced into DNA segments 

20 by methods well known to the art. See, for example, Sambrook et al., supra. 
Another preferred embodiment of the invention is an isolated and 
purified DNA molecule comprising a preselected DNA segment encoding an 
Has3 polypeptide having SEQ ID NO:32, wherein the DNA segment comprises 
SEQ ID NO:3 1 , or variants of SEQ ID NO:3 1 having nucleotide substitutions 

25 which are "silent." That is, when nucleotide substitutions are present in a codon, 
the same amino acid is encoded by the codon with the nucleotide substitution as 
is encoded by the codon without the substitution. For example, leucine is 
encoded by the codon CTT, CTC, CTA and CTG. A variant of SEQ ID NO:3 1 
at the Fifth codon (CTQ in SEQ ID NO:31) includes the substitution of CTT, 

30 CTC or CTA for CTQ. Other "silent" nucleotide substitutions in SEQ ID NO:3 1 
which can encode a polypeptide having SEQ ID NO:32 can be ascertained by 
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reference to page Dl in Appendix D in Sambrook et al., Molecular Cloning: A 
Laboratory Manual (1989). Nucleotide substitutions can be introduced into 
DNA segments by methods well known to the art. See, for example, Sambrook 
et al., supra. 



As used herein, "chimeric" means that a vector comprises DNA from at 
least two different species, or comprises DNA from the same species, which is 
linked or associated in a manner which does not occur in the "native" or wild 
1 0 type of the species. 

The recombinant or preselected DNA sequence or segment, used for 
transformation herein, may be circular or linear, double-stranded or single- 
stranded. Generally, the preselected DNA sequence or segment is in the form of 
chimeric DNA, such as plasmid DNA, that can also contain coding regions 
1 5 flanked by control sequences which promote the expression of the preselected 
DNA present in the resultant cell line. Aside from preselected DNA sequences 
that serve as transcription units for Has2, Has3, or portions thereof, a portion of 
the preselected DNA may be untranscribed, serving a regulatory or a structural 
function. For example, the preselected DNA may itself comprise a promoter that 
20 is active in mammalian cells, or may utilize a promoter already present in the 
genome that is the transformation target. Such promoters include the CMV 
promoter, as well as the SV40 late promoter and retroviral LTRs (long terminal 
repeat elements), although many other promoter elements well known to the art 
may be employed in the practice of the invention. A preferred promoter useful 
25 in the practice of the invention is the CMV promoter. 

Other elements functional in the host cells, such as introns, enhancers, 
polyadenylation sequences and the like, may also be a part of the preselected 
DNA. Such elements may or may not be necessary for the function of the DNA, 
but may provide improved expression of the DNA by affecting transcription, 
30 stability of the mRNA, or the like. Such elements may be included in the DNA 
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as desired to obtain the optimal performance of the transforming DNA in the 
cell. 

"Control sequences" is defined to mean DNA sequences necessary for the 
expression of an operably linked coding sequence in a particular host organism. 
5 The control sequences that are suitable for prokaryotic cells, for example, 

include a promoter, and optionally an operator sequence, and a ribosome binding 
site. Eukaryotic cells are known to utilize promoters, polyadenylation signals, 
and enhancers. 

"Operably linked" is defined to mean that the nucleic acids are placed in 

1 0 a functional relationship with another nucleic acid sequence. For example, DNA 
for a presequence or secretory leader is operably linked to DNA for a 
polypeptide if it is expressed as a preprotein that participates in the secretion of 
the polypeptide; a promoter or enhancer is operably linked to a coding sequence 
if it affects the transcription of the sequence; or a ribosome binding site is 

1 5 operably linked to a coding sequence if it is positioned so as to facilitate 

translation. Generally, "operably linked" means that the DNA sequences being 
linked are contiguous and, in the case of a secretory leader, contiguous and in 
reading phase. However, enhancers do not have to be contiguous. Linking is 
accomplished by ligation at convenient restriction sites. If such sites do not 

20 exist, the synthetic oligonucleotide adaptors or linkers are used in accord with 
conventional practice. 

The preselected DNA to be introduced into the cells further will generally 
contain either a selectable marker gene or a reporter gene or both to facilitate 
identification and selection of transformed cells from the population of cells 

25 sought to be transformed. Alternatively, the selectable marker may be carried on 
a separate piece of DNA and used in a co-transformation procedure. Both 
selectable markers and reporter genes may be flanked with appropriate 
regulatory sequences to enable expression in the host cells. Useful selectable 
markers are well known in the art and include, for example, antibiotic and 

30 herbicide-resistance genes, such as neo, hpt, dhfr, bar, aroA, dap A and the like. 
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See also, the genes listed on Table 1 of Lundquist et al. (U.S. Patent No. 
5,848,956). 



Reporter genes are used for identifying potentially transformed cells and 
for evaluating the functionality of regulatory sequences. Reporter genes which 
5 encode for easily assayable proteins are well known in the art. In general, a 




organism or tissue and which encodes a protein whose expression is manifested 
by some easily detectable property, e.g., enzymatic activity. Preferred genes 
include the chloramphenicol acetyl transferase gene (cat) from Tn9 of E. coli, the 



10 beta-glucuronidase gene (gus) of the uidA locus of E, coli, and the luciferase 

gene from firefly Photinus pyralis. Expression of the reporter gene is assayed at 
a suitable time after the DNA has been introduced into the recipient cells. 

The general methods for constructing recombinant DNA which can 
transform target cells are well known to those skilled in the art, and the same 

1 5 compositions and methods of construction may be utilized to produce the DNA 
useful herein. For example, J. Sambrook et al., Molecular Cloning: A 
Laboratory Manual , Cold Spring Harbor Laboratory Press (2d ed., 1989), 
provides suitable methods of construction. 

20 Transformation into Host Cells 

The recombinant DNA can be readily introduced into the host cells by 
transfection with an expression vector comprising DNA encoding Has2, or an 
expression vector comprising DNA encoding Has3, by any procedure useful for 
the introduction into a particular cell, e.g., calcium phosphate precipitation, 

25 lipofection, electroporation, and the like. 

As used herein, the term "cell line" or "host cell" is intended to refer to 
well-characterized homogenous, biologically pure populations of cells. These 
cells may be eukaryotic cells that are neoplastic or which have been 
"immortalized" in vitro by methods known in the art, as well as primary cells, or 

30 prokaryotic cells. The cell line or host cell is preferably of mammalian origin, 
but cell lines or host cells of non-mammalian origin may be employed, including 
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plant, insect, yeast, fungal or bacterial sources. Generally, the preselected DNA 
sequence is resident in the genome of the host cell but is not expressed, or not 
highly expressed. 

"Transfected" or "transformed" is used herein to include any host cell or 
5 cell line, the genome of which has been altered or augmented by the presence of 
at least one preselected DNA sequence, which DNA is also referred to in the art 
of genetic engineering as "heterologous DNA," "recombinant DNA," 
"exogenous DNA," "genetically engineered," "non-native," or "foreign DNA," 
wherein said DNA was isolated and introduced into the genome of the host ceJl 

10 or cell line by the process of genetic engineering. The host cells of the present 
invention are typically produced by transfection with a DNA sequence in a 
plasmid expression vector, a viral expression vector, or as an isolated linear 
DNA sequence. Preferably, the transfected DNA is a chromosomally integrated 
recombinant DNA sequence, which comprises a gene encoding Has2, or which 

1 5 comprises a gene encoding Has3, which host cell may or may not express 
significant levels of autologous or "native" hyaluronan. 

tfe&2 qt Has 3 Polypeptides 

The present invention provides an isolated, purified Has2, or an isolated, 

20 purified Has3, which can be prepared by recombinant DNA methodologies. The 
general methods for isolating and purifying a recombinantly expressed protein 
from a host cell are well known to those in the art. Examples of the isolation and 
purification of such proteins are given in Sambrook et al., cited supra. 
Moreover, since the present invention provides the complete amino acid 

25 sequence of murine Has2 (Figure 3), and murine Has3 (Figure 12B), they or 
bioactive variants thereof can also be synthesized by the solid phase peptide 
synthetic method. This established and widely used method, including the 
experimental procedures, is described in the following references: Stewart et al, 
Solid Phase Peptide Synthesis . W. H. Freeman Co., San Francisco (1969); 

30 Merrifield, J. Am. Chem. Soc . SI 2149 (1963); Meienhofer in "Hormonal 

Proteins and Peptides," ed.; C.H. Li, Vol. 2 (Academic Press, 1973), pp. 48-267; 
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and Bavaay and Merrifield, "The Peptides," eds. E. Gross and F. Meienhofer, 
Vol. 2 (Academic Press, 1980) pp. 3-285. 

When Has2 or Has3 polypeptide is expressed in a recombinant cell, 
preferably a Has2- or Has3- cell, respectively, it is necessary to purify Has2 or 
Has3 polypeptide from recombinant cell proteins or polypeptides to obtain 

For example, the culture medium or lysate can be centrifuged to remove 
particulate cell debris. The membrane and soluble protein fractions are then 
separated. The Has3 polypeptide may then be purified from the soluble protein 
10 fraction and, if necessary, from the membrane fraction of the culture lysate. 
Has3 polypeptide can then be purified from contaminant soluble or membrane 
proteins and polypeptides by fractionation on immunoaffinity or ion-exchange 
columns; ethanol precipitation; reverse phase HPLC; chromatography on silica 
or on an anion-exchange resin such as DEAE; chromatofocusing; SDS-PAGE; 
15 ammonium sulfate precipitation; gel filtration using, for example, Sephadex G- 
75; or ligand affinity chromatography. 

Has2 polypeptide, Has3 polypeptide, variant Has2 polypeptides, variant 
Has3 polypeptides, or biologically active subunits thereof can also be prepared 
by in vitro transcription and translation reactions. For example, a Has3 
20 expression cassette can be employed to generate Has3 transcripts which are 
subsequently translated in vitro so as to result in a preparation of substantially 
homogenous Has3, variant Has3, or biologically active subunits thereof. The 
construction of vectors for use in vitro transcription/translation reactions, as well 
as the methodologies for such reactions, are well known to the art. 
25 Once isolated from the resulting transgenic host cells or from in vitro 

transcription/translation reactions, derivatives and chemically derived variants of 
the Has2 polypeptide or Has 3 polypeptide can be readily prepared. For 
example, amides of the Has3 polypeptides of the present invention may also be 
prepared by techniques well known in the art for converting a carboxyiic acid 
30 group or precursor, to an amide. A preferred method for amide formation at the 
C-terminal carboxyl group is to cleave the polypeptide from a solid support with 



«OOCtD: <WO_9800661A4JU» 



WO 98/00551 



PCT/US97/I1761 



29 

an appropriate amine, or to cleave in the presence of an alcohol, yielding an 
ester, followed by aminoiysis with the desired amine. 

Salts of carboxyl groups of the Has2 polypeptide or Has3 polypeptide 
may be prepared in the usual manner by contacting the peptide with one or more 
5 equivalents of a desired base such as, for example, a metallic hydroxide base, 
e.g., sodium hydroxide; a metal carbonate or bicarbonate base such as, for 
example, sodium carbonate or sodium bicarbonate; or an amine base such as, for 
example, triethylamine, triethanolamine, and the like. 

N-acyl derivatives of an amino group of the present polypeptides may be 

1 0 prepared by utilizing an N-acyl protected amino acid for the final condensation, 
or by acylating a protected or unprotected peptide. O-acyl derivatives may be 
prepared, for example, by acylation of a free hydroxy peptide or peptide resin. 
Either acylation may be carried out using standard acylating reagents such as 
acyl halides, anhydrides, acyl imidazoles, and the like. Both N- and O-acylation 

1 5 may be carried out together, if desired. In addition, the internal Has2 or Has3 
amino acid sequence of Figure 3 or Figure 12B, respectively, can be modified by 
substituting one or two conservative amino acid substitutions for the positions 
specified, including substitutions which utilize the D rather than L form. The 
invention is also directed to variant or modified forms of the Has2 polypeptide or 

20 Has 3 polypeptide. One or more of the residues of the Has 2 polypeptide can be 
altered, so long as the variant polypeptide has at least about 50% of the 
biological activity of the protein having SEQ ID NO:2. One or more of the 
residues of the Has 3 polypeptide can be altered, so long as the variant 
polypeptide has at least about 50% of the biological activity of the protein 

25 having SEQ ID NO:32. Conservative amino acid substitutions are preferred—that 
is, for example, aspartic-glutamic as acidic amino acids; lysine/arginine/histidine 
as basic amino acids; leucine/isoleucine, methionine/valine, alanine/valine as 
hydrophobic amino acids; serine/glycine/alanine/threonine as hydrophilic amino 
acids. 

30 Acid addition salts of the polypeptides may be prepared by contacting the 

polypeptide with one or more equivalents of the desired inorganic or organic 
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acid, such as, for example, hydrochloric acid. Esters of carboxyl groups of the 
polypeptides may also be prepared by any of the usual methods known in the art. 

Has2 or Has 3 Variant Polypeptides 



5 It is envisioned that variant Has2 polypeptides have at least one amino 




In particular, amino acids are substituted in a relatively conservative manner. 
Such conservative substitutions are shown in Table 1 under the heading of 
1 0 exemplary substitutions. More preferred substitutions are under the heading of 
preferred substitutions. After the substitutions are introduced, the products are 
screened for biological activity. 
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TABLE 1 



Original 


Exemplary 


Preferred 


Residue 


Substitutions 


Substitutions 


Ala (A) 


val; leu; ile 


val 


Are (R) 


lys; gin; asn 


lys 


Acn fW^ 

AVoIl ) 


oln* Vmq* Ivq* rto 
gill, mo, iyo 9 aig 


gin 


Asp (u) 


glu 


glu 


Cys (C) 


ser 


ser 


Gin (Q) 


asn 


asn 


GIu (E) 


asp 


asp 


Gly (G) 


pro 


pro 


His (ri) 


asn; gin; lys; arg 


arg 


He (I) 


leu; val; met; ala; phe 


leu 




nnrlfMinine 




Leu (D 


norleucine* ile* val* met* 


ile 




ala; phe 




Lys (K) 


arg; gin; asn 


are 


Met CM) 


leu; phe; ile 


leu 


Phe (F) 


leu; val; ile; ala 


leu 


Pro (P) 


giy 


giy 


Ser (S) 


thr 


thr 


Thr (T) 


ser 


ser 


Trp(W) 


tyr 


tyr 


Tyr(Y) 


trp; phe; thr; ser 


phe 


Val (V) 


ile; leu; met; phe; ala; 


leu 




norleucine 





25 

Amino acid substitutions falling within the scope of the invention, are, in 
general, accomplished by selecting substitutions that do not differ significantly 
in their effect on maintaining (a) the structure of the polypeptide backbone in the 
area of the substitution, for example, as a sheet or helical conformation, (b) the 
30 charge or hydrophobicity of the molecule at the target site, or (c) the bulk of the 
side chain. Naturally occurring residues are divided into groups based on 
common side-chain properties: 

(1) hydrophobic: norleucine, met, ala, val, leu, ile; 

(2) neutral hydrophilic: cys, ser, thr; 
35 (3) acidic: asp, glu; 
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(4) basic: asn, gin, his, lys, arg; 

(5) residues that influence chain orientation: gly, pro; and 

(6) aromatic; tip, tyr, phe. 

The invention also envisions Has2 or Has3 variants with non- 



5 conservative substitutions. Non-conservative substitutions entail exchanging a 




well known to the art. For example, see the description hereinabove for the 
introduction of silent mutations into the DNA molecules of the invention. 



10 

Uses of Has2 or Has3 Genes and Poly peptides Thereof 

The genes involved in HA biosynthesis and extracellular accumulation of 
HA ("HA coat formation") can be employed to synthesize HA in vitro. Because 
in vitro synthesized HA is of extremely high purity, is free from bacterial and 
1 5 animal cell contaminants, and can be optimized as to its physicochemical 

properties, it is preferred to HA derived by extraction from bacterial or animal 
sources. 

In vitro prepared HA has a similar range of applications as those 
described above for HA which is derived from animal or bacterial cells, e.g., 

20 protecting eye tissue during artificial intraocular leris implantation, as a drug 

delivery vehicle, and preventing or inhibiting post-operative adhesions. In vitro 
synthesized HA may also be employed to enhance or promote wound healing or 
tissue repair, e.g., to prevent restenosis following balloon angioplasty, and to 
repair or replace damaged or absent cartilage present in congenital defects, 

25 craniofacial disorders and arthritis. In addition, HA can be derivatized, as 

described in Balazs et al. ( Blood Coag. Fibrinolysis . 2, 173 (1991)), to provide 
improved mechanical properties and an extended residence time in vivo. 

Moreover, the identification of genes involved in HA biosynthesis and/or 
coat formation may also be useful for defining the molecular basis for genetic 

30 diseases, such as cartilage pathologies, e.g, rheumatoid arthritis, and for 

providing a clinically useful diagnostic test or in molecular-based therapeutics. 
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Once such a gene has been identified, a probe specific for the gene can be made. 
Patient DNA can be screened with the probe to detect particular genetic variants 
that correlate with disease, e.g., craniofacial disorders. Patient RNA can be 
incubated with the probe to determine if the gene is over or under expressed in a 
5 patient with a particular disease relative to disease-free patients. 

Furthermore, the cloning of genes involved in HA biosynthesis and/or 
extracellular coat formation will help to elucidate the molecular mechanism 
giving rise to the alteration of the protein encoded by the gene, or its expression, 
in patients having a particular disorder, e.g., cartilage deficiency. Once the 

1 0 molecular mechanism underlying the expression of the gene is understood, 
molecular genetic-based therapies directed to controlling the expression of the 
gene can then be employed to correct or supplement the expression of the gene 
in patients with the disorder. 

For example, accelerated HA degradation accompanies osteoarthritis and 

1 5 inflammatory arthritides. Thus, the administration of Has2 and/or Has3 
polypeptide, expression vectors encoding Has2 and/or Has3 polypeptide or 
agents that increase the expression or activity of native (i.e., endogenous) Has2 
and/or Has3 may be efficacious for diseases which are characterized by 
decreased levels of HA, Hyperthyroidism (Graves Disease) is associated with 

20 excessive accumulation of HA in retro-orbital connective tissues, in the pretibial 
area and elsewhere. In addition, various ill-characterized skin disorders or 
mucinosis are also associated with accumulation of HA in the dermis. Thus, the 
administration of agents that inhibit the expression or activity of native Has2 
and/or Has3 or expression vectors comprising has2 and/or has3 antisense 

25 sequences, may be useful to prevent or treat these disorders. 

In addition, high serum levels of HA are associated rheumatoid arthritis, 
septic conditions accompanying certain malignancies, e.g., mesothelioma and 
Wilms' tumor, and edema due to inflammation in the lung and in kidneys post- 
kidney transplantation. HA has also been implicated in Grave's ophthalmopathy, 

30 cirrhosis of the liver and accelerated aging in Werner's syndrome. Thus, the 
isolation of eukaryotic HA biosynthetic genes can be useful in gene therapies 
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which employ the cloned genes in antisense expression vectors to inhibit or 
reduce the overexpression of HA genes in these patient populations. For 
example, an expression vector containing antisense Has3 can be introduced into 
joints (for rheumatoid arthritis), or into mesothelioma or Wilms' tumor cells, to 
inhibit or reduce the overexpression of Has3. 



m 

Agents that increase or decrease native Has2 or Has3 activity or 
expression may be identified using in vitro assays. For example, cells with low 

10 basal Has2 or Has3 activity, such as Chinese Hamster Ovary (CHO) cells, are 
stably transfected with recombinant plasmids that express Has2 and/or Has3. 
The resulting cell lines are then contacted with an agent and the amount of HA 
synthesized or secreted, and the amount of HA coat formation, in the presence of 
the agent relative to cells not exposed to the agent, is determined, using methods 

15 described herein. To assess coat formation, a bead binding assay may be 
employed. In this assay, polypeptide fragments with HA binding activity 
(so-called HA binding domain or HABR) are covalently attached to micro-beads 
tagged by fluorescent or other means (e.g., biotinylation). Agents that enhance 
HA coat formation may be useful to decrease the adhesive properties of tissue, 

20 e.g., rnesothelial, surfaces. 

Screening for agents that regulate Has2 and/or Has3 activity may also be 
accomplished using an assay described in Spicer et al., f J. Biol. Chem. . 272 . 
8957 (1997)). Radiolabeled UDP-sugar substrates (either UDP-N-acetyl-D- 
glucosamine or UDP-D-glucuronate) in the presence of the other required 

25 substrates are incubated with membrane extracts (10-25 mg protein) in the 

presence or absence of the agent for 2 hours at 37°C. The radiolabeled precursor 
molecules are then separated from the high molecular weight HA product by 
paper chromatography and agarose gel electrophoresis. Paper chromatography 
allows accurate quantification of enzyme activity, while agarose gel 

30 electrophoresis allows rapid assessment of molecular mass. Filter assays using 
precipitation with cetylpyridinium chloride or HPLC isolation of reductive 
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products of HA degradation by Streptomyces hyaluronidase may also be 
employed. Direct interaction of an agent with Has2 and/or Has3 may be 
determined by binding assays utilizing purified, recombinant Has2 and/or Has3 
polypeptide present in liposomes or detergent micelles and labeled agent. 
5 Agents that interact with highly conserved sequences present in enzymes 

involved in synthesis of p 1-4 linkages may be useful to inhibit native Has2 
and/or Has3. S. cerevisiae chitin synthase 2 (Chs2) has two highly conserved 
domains present in all chitin synthases that are critical to enzymatic activity and 
speculated to be generally conserved in glycosyltransferases that catalyze the 

10 synthesis of oligosaccharides with P 1 -4 linkages (Nagahasi et al., J. Biol 
Chem. . 270 . 13961 (1995)). Sequence alignments of Has 1, mHas2, DG42, 
HasA, NodC, and Chs2 revealed that several amino acid residues required for 
catalytic activity of Chs2 are conserved in mHas2 and mHas3. In particular, the 
second region of homology in Chs2 contains the highly conserved motif 

15 NMYLA-EDRIL residues (556-565; SEQ ID NO:56). Mutations at residue 562 
in Chs2 resulted in complete loss of enzymatic activity. The similarity of mHas2 
in this region (NQCSFGDDRH; SEQ ID NO:57) suggests that mutation of the 
highly conserved D at position 314 may result in loss of enzymatic activity. 
Expression of a mutant mHas2, having an amino acid substitution (D- A) at this 

20 position, in COS-1 cells did not result in coat formation. Similarly, agents that 
are ligand mimetics, e.g., 5-azido-UDP-glucuronic acid, may be tested for their 
ability to alter Has2 and/or Has3 activity. Thus, agents that interact with 
domains which comprise residues required for catalytic activity may be useful in 
vivo inhibitors of Has2 and/or Has3 activity. 

25 

Methods to Administer has2 or has3 Genes or Polypeptides to Tissue Surfaces 

Delivery of has2 and/or has3 genes (e.g., in viral vectors or liposomes) or 
purified Has2 and Has3 polypeptide (e.g., in liposomes) to tissue, e.g., 
mesothelial, surfaces provides an alternative approach to exogenous instillation 
30 of HA containing solutions or HA containing films to coat opposing surfaces 
with HA, to decrease adhesivity. To determine whether has genes or purified 
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Has polypeptide are useful to alter mesothelial HA synthesis or accumulation, 
cultured mesothelial cells are transfected with has2 and/or has3 expression 
vectors and/or contacted with purified Has2 and/or Has3 polypeptide. Sections 
of serosa stripped off of the underlying mesothelial tissue may also be employed. 
5 These sections are maintained in suspended well culture (e.g., Becton-Dickson 




culture medium of cultured cells or serosa, and secretion of HA analyzed by 
removing the culture medium, and determining the incorporation of radiolabeled 



10 precursor into a high molecular weight form (e.g., > 1 x 10 6 Daltons) which is 
sensitive to degradation by Streptomyces hyaluronylticus . HA coat formation 
can also be determined by fixation of the cells in the presence of 
cetyltrimethylammonium bromine (CTAB), followed by immunohistochemical 
staining with purified HA binding domain conjugated to biotin. 

1 5 These in vitro tests can be extended to in vivo models in small animals 

(e.g., rats, mice), in which viral vectors containing cDNAs encoding Has2 and/or 
Has3, or purified, recombinant Has2 and/or Has3 polypeptide are introduced into 
the peritoneal cavity. To assess optimal dosing, two approaches are envisioned. 
First, to optimize the production of HA by the peritoneal surface, extensive 

20 peritoneal lavage to remove free HA is performed. The HA can be quantified, 
using methods outlined herein. Then, fixation in situ using CTAB containing 
fixative, followed by staining for HA with biotinylated HA binding domain is 
employed to show cell surface HA. Optimal dosages of viral vectors and/or 
recombinant polypeptide depend upon the specific application (e.g., operative 

25 site, specific surgery) and desired outcome (persistence of HA secretion and 

anti-adhesive properties). The presence or amount of HA on mucosal or serosal 
surfaces in vivo can be determined using labeled proteins containing HA binding 
domains (Ripellino et ai., J. Histochem. & Cvtochem.. 21, 1060 (1985); 
Fenderson et al., Different. . 54, 85 (1993)). Likewise, small molecules, 

30 identified on the basis of their ability to stimulate or inhibit HA secretion in vitro 
can be tested in similar models. 
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The invention will be further described by the following examples. 

Ex ample I 

cDNA Cloning and Characterization of Mouse Hyaluronan Synthase-2 

The aligned amino acid sequences of HasA, DG42 and NodC were 
5 utilized to prepare primers for a degenerate PCR strategy to identify a 

HasA/DG42 related cDNA in the mouse. Three degenerate primer pools for RT- 
PCR were prepared, two of which were predicted to anneal to the antisense 
strand, and one of which was predicted to anneal to the sense strand of a putative 
eukaryotic DNA molecule which encodes HA synthase. The oligonucleotides 

10 were made corresponding to the peptide sequences AFNVERACQ (SEQ ID 

NO: 1 4), GDDRHLTN (SEQ ID NO: 1 5), and QQTRWTKSYF (SEQ ID NO: 1 6), 
and had the following degenerate nucleotide sequences: DEG 1 primer, 5'-GCN 
TTY AAY GTN GAR MGN GCN TGY CA 3* (SEQ ID NO: 17, sense strand), 
DEG 3 primer, 5*-RTT NGT NAR RTG NCK RTC RTC NCC-3* (SEQ ID 

15 NO: 18, antisense strand), and DEG 5 primer, 5'-RAA RTA NSW YTT NGT 
CCA NCK NGT YTG YTG-3 1 (SEQ ID NO:19, antisense strand). 

A degenerate primer pool made to the peptide sequence QQTRWTKSYF 
(SEQ ID NO: 16, DEG 5) was utilized for the first-strand synthesis. RNA was 
isolated using TRIZOL™ reagent (GIBCO-BRL/Life Technologies, 

20 Gaithersburg, MD) according to the manufacturer's directions. Reverse 

transcription reactions were performed on total RNA isolated from 10.5 and 14.5 
days post coitum (dpc) C57BL/6J mouse embryos. Briefly, 5 \xg of total RNA 
were heat-denatured at 95 °C then split into two separate reactions. One reaction 
served as a control and amplified a fragment of 28S ribosomal RNA. The 

25 second reaction received one of two degenerate primer pools at a final 

concentration of 2 |iM Reverse-transcription was carried out at 42 °C using 10 
units M-MuLV reverse transcriptase (Boehringer Mannheim, Indianapolis, rN) 
in a total volume of 25 \xl. 

Five microliters of each resultant first-strand cDNA were amplified in 

30 separate 1 00 \x\ PCR reactions using combinations of degenerate primer pools 
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1 and 3 (DEG 1/3) or 1 and 5 (DEG 1/5). Amplification conditions were as 
follows: 35 cycles of 94°C for 1 minute, 50°C for 1 minute, 72°C for 1 minute, 
followed by a final extension of 72 °C for 10 minutes. Primer pools were used at 
a final concentration of 1 \xM, Twenty microliters of each PCR reaction was 



5 separated through a 2.0% agarose gel (Figure 1). All consistently amplified 




prepared as described by Marchuk et al. f Nucleic Acids Re^. . 19 . 1 154 (1991)). 
The resultant plasmids were subjected to restriction endonuclease and dideoxy 



10 sequencing of double-stranded plasmid DNAs using a Sequenase Version 2.0 
sequencing kit (United States Biochemical Corp, Cleveland, OH). 

The 300 bp DEG 1/5 product (MHas300) and the 180 bp DEG 1/3 
product were related by a common internal site for the restriction endonuclease 
EcoRI, as shown below the gel image in Figure 1 . Sequence analysis of the 

15 other consistently amplified PCR products indicated that they were unrelated to 
mouse HAS (Itano et al., J.Biol. Chem .. 2ZL 9875 (1996)) hasA, DG42, nodC, 
and the 1 80 bp and 300 bp PCR products. 

The 300 bp cDNA fragment, MHas300 was utilized as a probe to screen 
a primary XgtlO cDNA library constructed from 8.5 dpc C57BL/6J po!yA+ RNA 

20 (kindly provided by Dr. J. J. Lee, Mayo Clinic Scottsdale). The probe was 
labeled to high specific activity using random-priming in the presence of 
[oc 32 P]dCTP (Feinberg et al.. Anal. Biochem.. 132. 6 (1984)). Approximately 
1.5 x 10 6 plaque-forming units (pfus) were screened using standard procedures 
(Sambrook et al., Molecular Cloning: A Laboratory Manual . Cold Spring Harbor 

25 (1989)). Double positive plaques were identified and taken through two 

additional rounds of plaque-purification. In addition, a portion of each primary 
plaque was screened by PCR, employing a combination of primers that flanked 
the A,gtl0 cloning site and MHas2 specific primers, to determine insert size 
relative to the MHas300 cDNA fragment. Fourteen positive clones were obtained 

30 and analyzed. The mouse A, cDNA library yielded multiple overlapping clones, 
which collectively spanned approximately 3 kb (Figure 2). EcoRI restriction 
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fragments were then subcloned into pBluescript KSII+ for sequence analysis. 
The nucleotide sequence of both strands was determined using synthetic 
oligonucleotide primers made to the mouse Has2 sequence and to the vector. 

Sequence analyses identified an open reading frame (ORF) of 1 656 bps, 
5 flanked by 5' and 3' untranslated regions (UTRs) of 507 and 772 bps, 

respectively (Figure 3, SEQ ID NO: 1). The open reading frame predicted a 63 
kDa protein with several transmembrane sequences, multiple consensus 
phosphorylation sites, and four putative hyaluronan binding motifs. The 
predicted translation initiation site conformed to the Kozak consensus for 

10 initiation (Kozak, Nucleic Acids Res, . 12, 857 (1984)). Although there were four 
additional upstream ATGs within the 5 1 UTR, none of these fitted the Kozak 
consensus and all were followed closely by in-frame stop codons. The presence 
of several upstream ATGs has, however, been more commonly described in 
oncogenic sequences (Kozak, Nucleic Acids Res.. 15. 8125 (1987)). The 3' UTR 

1 5 contained two consensus sequences for polyadenylation, a C A repeat and a TA 
repeat (Figure 3). 

Database searches indicated that the predicted amino acid sequence of 
mouse Has2 (SEQ ID NO:2) aligned most significantly with Xenopus DG42 
(SEQ ID NO: 10; 56% identity, 70% similarity; Rosa et al., Dev. Biol. . 129 . 1 14 

20 ( 1 987)), Streptococcal HasA (SEQ ID NO: 1 1 ; 2 1 % identity, 28% similarity; 

DeAngelis et al.. J. Biol. Chem. . 268 . 19181 (1993)), Rhizobium sp NodC (SEQ 
ID NO: 12; Jacobs et al, J. Bacteriol. . \62 . 469 (1985); Collins-Emerson et al., 
Nucleic Acids Res. . 18 . 6690 ( 1 990)), and Saccharomyces cerevisiae chitin 
synthase 2 (Chs2) (SEQ IDNO:13; Bulawa. Mol. Cell. Biol. , 12. 1764 (1992)) 

25 (Figure 5). In addition, mouse Has2 displayed 55% identity and 73% similarity 
to the recently reported mouse Hasl gene (SEQ ID NO: 1 1 , Itano et al M J. Biol. 
Chem. . 271 . 9875 (1996)), and the human homologue of this gene (Yang et al., 
EMBO J. . 13t 286 (1994)). Surprisingly, the deduced amino acid sequence of 
the cDNA of Itano et al. is distinct from the Has2 cDNA described hereinbelow, 

30 although the sequences are clearly related. 
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Recently isolated clones for a second human Has gene, which shares 
greater than 90% amino acid identity to mouse Has2 and thus is predicted to 
represent the human Has2 gene have also been obtained (SEQ ID NO:23). This 
suggests that there are at least two related Has genes in both mouse and humans. 
5 Investigation of the primary amino acid sequence of mouse Has2 




(1994)), and numerous consensus sequences for phosphorylation by protein 
kinase C (PKC) and cyclic-AMP dependent kinases, such as protein kinase A 

10 (PKA) (Person et al. t In: Protein Phosphorylation: A Practical A pproach 

(Hardie, D. G., ed), IRL Press at Oxford University Press, Oxford (1993)). Has2 
is predicted to be a multiple membrane-spanning protein with a large 
cytoplasmic loop, similar to the predicted structure of Streptococcus HasA and 
mouse HAS (Hasl) (Figure 6B). Sequence alignment of Has2 with 

15 Saccharomyces cerevisiae Chitin synthase2 (Chs2; SEQ ID NO: 13) (Figure 5) 
demonstrated that the residues recently shown to be required for catalytic activity 
in Chs2 (Nagahashi et al., J. Biol. Chem. . 270 . 13961 (1995)) are conserved 
within the large predicted cytoplasmic loop of mouse Has2 (Figure 6B). It has 
been suggested that these catalytic residues may be generally conserved within 

20 glycosyltransferases that catalyze the synthesis of oligosaccharides with P 1 -4 
linkages (Nagahashi et al., supra). Significantly, the predicted cytoplasmic loop 
of the Has2 molecule is the most highly conserved across species, and thus this 
part of the protein may form the catalytic domain. 

Example 2 

25 Molecular Biochemical Characterization of Mouse Has2 

Northern and Southern Analysis. Mouse multiple tissue Northern (MTN) 
Blots (CLONTECH, Palo Alto, CA) were hybridized to a [~ 32 P]dCTP-labeled 
cDNA probe corresponding to the 1 .65 kb open-reading-frame (ORF) of the 
mouse Has2 gene. Blots were hybridized at 42°C and washed to high stringency 

30 according to the manufacturer's recommendations. The mouse embryo blot was 
exposed overnight at -70°C to BioMax MR film (Eastman Kodak Company, 
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New Haven, CT) with two intensifying screens, whereas the adult tissue blot was 
exposed for six days at -70°C with two screens. To control for variation in 
loading, both blots were stripped, and rehybridized with a mouse 
glyceraldehyde-3-phosphate dehydrogenase (GAPDH) probe. Both GAPDH 
5 hybridized blots were exposed for one hour at 
-70°C with two screens. 

Northern analyses detected two transcripts of approximately 3.2 kb and 
4.8 kb, respectively, in embryonic samples (Figure 7). Only the 4.8 kb message 
was observed in RNA from adult tissues. The 4,8 kb transcript was expressed at 

1 0 levels approximately 20 fold higher than the 3.2 kb transcript. High levels of 
expression were observed in the developing mouse embryo, in addition to lower 
levels in adult mouse heart, brain, spleen, lung and skeletal muscle (Figure 6). 
All of the isolated cDNA clones were predicted to form an identical ORF, Thus, 
rather than being the result of alternate splicing, the 4.8 kb transcript most 

1 5 probably corresponds to a mouse Has2 mRNA with an alternate poly A signal, 
generating a 3' UTR with approximately 1.8 kb of sequence, in addition to that 
reported herein. 

Moreover, the observed expression pattern of mouse Has2, i.e., Has2 
expression was detected in the primitive streak stage embryo (7.5 dpc) and an 

20 increase in Has2 expression in the later embryo, correlates well with the 
previously described expression pattern of HA. HA has previously been 
observed at significant levels starting as early as the egg cylinder stage (5.5 dpc), 
when it is secreted into the expanding yolk cavity. Thus, HA may play a role in 
the formation and expansion of embryonic cavities. From 9.5 dpc, synthesis 

25 increases, and the HA assumes more of a pericellular distribution, rather than 
being primarily associated with fluid-filled spaces. HA is present at high levels 
within the developing vertebral column, the neural crest-derived mesenchyme of 
the craniofacial region, and the heart and smooth muscle throughout the mid- 
gestation embryo. 

30 In the adult, Has2 expression was detected in heart, brain, spleen, lung 

and skeletal muscle, but not in liver or kidney (Figure 7). The level of 
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expression of Has2 was markedly reduced in adult tissues as compared to the 
embryo. 

Mouse 129Sv/J genomic DNA was prepared from tail snips using 
standard procedures. Approximately 15 ng samples of genomic DNA were 
5 digested overnight with restriction endonucleases, size-separated through 0.8% 
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cDNA probe corresponding to the 1.65 kb ORF of mouse Has2. Hybridization 
conditions were performed as recommended by the manufacturer. Membranes 
10 were washed to low (1 X SSC + 0.1% SDS at 37 °C) and high (0.1 X SSC + 
0.1% SDS at 55 °C) stringency (1 X SSC (saline sodium citrate) is 150 mM 
NaCl, 15 mM Na citrate) and autoradiography was performed as described 
above. 

The pattern of hybridizing restriction fragments that was observed 

15 through Southern analyses was consistent with mouse Has2 being a single copy 
gene within the mouse genome (Figure 8). In addition, the pattern observed in 
digests of total mouse genomic DNA was identical to that observed in equivalent 
digests of recently isolated mouse Has2 genomic clones. Low stringency wash 
conditions failed to identify any further hybridizing fragments including those 

20 fragments corresponding to the related mouse Hasl (Itano et al., supra) gene. 

This suggests that the level of sequence identity (55%) between mouse Has2 and 
mouse Hasl, and possibly other Has-related genes, is not sufficient to permit 
detection through Southern hybridization even at low stringency. Thus, while 
these results preclude the existence of a mouse Has2 pseudogene, they do not 

25 preclude the existence of other genes related to mouse Has2 and mouse Hasl . 

Transfection Studies . To investigate the potential role of mouse Has2 in 
HA biosynthesis, expression constructs were created in the mammalian 
expression vector, pCIneo (Promega Corporation, Madison, WI). Mouse Has2 
ORFs were amplified by PCR, from a template of mouse Has cDNA clone W 1 .1 

30 (Figure 2). PCR primers were designed to create a mouse Has2 cDNA with an 
optimized Kozak consensus A--ATGG, and to contain Smal/Xmal sites at each 
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end suitable for cloning. Primers were as follows: 5'-CCCGGGCAAG ATG 
GAT TGT GAG AGG TTT CTA TGT GTC CTG -3* (SEQ ID NO:21 , bps 504 
to 537, Figure 3) and 5*-CCCGGG TCA TAC ATC A AG CAC CAT GTC ATA 
CTG -3' (SEQ ID NO:22, bps 2163 to 2137, Figure 3). Gel-purified PCR 
5 products were cloned directly into a pBluescript KSII+ T-vector for sequence 
verification, prior to subcloning into the Xmal site of pCIneo. 

The mouse Has2 expression vector was co-transfected with a 
cytomegalovirus promoter (CMV) driven p-gal expression vector into COS-1 
(SV40-transformed African green monkey kidney) cells (Gluzman, Cell . 23. 175 

10 ( 1 98 1 )) using Lipofectamine™ (GIBCO-BRL/Life Technologies, Gaithersburg, 
MD), according to the manufacturer's instructions. The p-gal expression plasmid 
was used in all transfections to permit the visual identification of cells that had 
been successfully transfected. Control co-transfections were pCIneo (vector 
control) and LacZ vector. Cells were analyzed 36 hours after lipofection 

15 (transient transfection). The COS-1 cell line and the mouse 3T6 (Swiss 

embryonic fibroblast) cell line were routinely maintained at 37°C in Dulbecco's 
modified Eagle's medium (DMEM) supplemented with 10% fetal bovine serum 
(FBS) and 2 mM L-glutamine, in a humidified chamber at 5% C0 2 . 

HA Coat Assays, Glutaraldehyde fixed horse erythrocytes (Sigma 

20 Chemical Company, St, Louis, MO) were reconstituted in phosphate-buffered 
saline (PBS), washed several times to remove traces of sodium azide, and finally 
resuspended in PBS plus 1 mg/ml BSA to a density of 5 x 10 8 cells/ml. HA 
coats were visualized around live cells growing in individual wells of a 24-well 
plate or 6- well plate by adding 1 x 10 7 or 5 x 10 7 red blood cells, respectively, to 

25 the growth medium. Red cells were allowed to settle for 1 5 minutes before HA 
coats were scored. To confirm the coats as being composed of HA, red cells 
were removed by extensive washing with PBS, and one well of each 
experimental sample was treated with 10 units/ml bovine testicular 
hyaluronidase (CALBIOCHEM, San Diego, CA) or 5 units/ml Streptomyces 

30 hyaluronidase (CALBIOCHEM, San Diego, CA) in DMEM plus 0.5% FBS for 
1 hour at 37°C. Equivalent wells were incubated under the same conditions in 



WO 98/00551 



PCT/US97/11761 



44 

the absence of hyaluronidase. After incubation, red cells were added to the 
wells, as previously described, and coats were again scored. HA coats were 
imaged at 200x magnification. After imaging, red cells were removed by 
extensive washing with PBS. Cells were stained to detect {J-galactosidase 
5 (LacZ) activity and imaged as described by Sanes et al. EMBQ J. . 5, 3133 



ability in HA pericellular coat-forming assays (Figure 9B). In contrast, 
untransfected 3T6 mouse embryonic fibroblast cells had well-developed HA 

10 coats (Figure 9A). Transient co-transfection of mouse Has2 and LacZ 

expression constructs into COS-1 cells resulted in the production of large HA 
coats (Figure 9D-I). Cells acquiring an HA coat also stained positively for p-gal 
activity (Figure 9D-I), confirming that cells that had generated HA coats had 
successfully taken up DNA. HA coats were destroyed by treatment with 

1 5 Streptomyces hyaluronidase (Figure 9H) or bovine testicular hyaluronidase. 
Control pCIneo transfected cells produced no coats (Figure 9C), and were 
indistinguishable from parental untransfected COS-1 cells. Equivalent numbers 
of LacZ positive cells were observed in experimental and control transfections. 
These results indicate that parental COS-1 cells express all other factors 

20 required for HA biosynthesis and pericellular coat formation, but most likely 

lack HA synthase activity. Thus, expression of Has2 in COS-1 cells is sufficient 

for HA coat formation. 

Discussion 

Residues demonstrated to be critical in terms of the pi -4 
25 glycosyltransferase activity of yeast Chs2 were conserved in mouse Has2, mouse 
Hasl, Streptococcal HasA, Xenopus DG42 and Rhizobium NodC. Thus, it is 
likely that mouse Has proteins have p 1-4 glycosyltransferase activity. 
Furthermore, although overall sequence identity between mouse Has2 and 
Streptococcus pyogenes HasA was only 21%, a 180 amino acid region within the 
30 predicted intracellular loop (residues 182 to 361) was highly conserved. This 
region exhibited 54% similarity between mouse Has2 and bacterial HasA, and 
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greater than 80% similarity between mouse Has2, mouse Has I, and Xenopus 
DG42. This level of sequence conservation suggests that these proteins are 
functionally related. 

Sequence analyses predicted that mouse Has2 and Has3 encode a 
5 membrane protein with multiple transmembrane domains, similar in structure to 
the bacterial HasA protein and mouse Hasl. Significantly, four consensus 
binding sites for HA were identified in Has2, three of which were predicted to be 
intracellular. These sites may thus represent areas of potential binding of HA 
chains during elongation, and/or may represent sites at which the newly 

1 0 synthesized HA polymer remains attached prior to release from the cell. In 
addition to putative HA binding sites, numerous consensus sequences for 
phosphorylation by PKC and cAMP-dependent kinases were identified within 
the predicted intracellular loop of the molecule. This is significant, as 
mammalian HA biosynthesis has been shown to be dependent on activation by 

1 5 PKC, and suggests that the PKC dependence may partly involve direct activation 
of Has2 through phosphorylation. 

HA-dependent pericellular coats have been proposed to form through two 
alternate mechanisms. The first mechanism is HA receptor-dependent and HA 
synthesis independent. This type of coat can form through association of HA 

20 with cell surface HA receptors, and stabilization of the coat by association of HA 
binding proteoglycans, such as aggrecan and link protein (Lee et al., J. Cell 
Biol. . 123 . 1899 (1993); Knudson et al., Proc. Natl. Acad. Sci. USA . 90, 4003 
(1 993)). Presumably, this permits cells expressing HA receptors to enter an 
environment rich in HA, and to organize an HA matrix around themselves that is 

25 independent of the ability to synthesize HA. 

The second mechanism is HA receptor independent, and requires the 
synthesis and extrusion of HA through the plasma membrane. It has been 
proposed that the extruded HA associates with the membrane through continued 
attachment to the synthase, and that this coat is stabilized by HA-HA and HA- 

30 protein bridges (Heldin et al., Exp. Cell Res. , 2Q8, 422 (1993)). 
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Expression of mouse Has2 by COS-] cells resulted in the formation of 
large well-pronounced HA coats, as determined by a particle exclusion assay 
(Figure 9). Previous studies in COS cells have shown that transfection of the 
HA receptor, CD44, and the addition of exogenous HA (15 (ig/ml) and 
proteoglycans to the medium was required for H A-dependent pericellular matrix 
formation (Knudson et al., BifigJ&itL Acad. Sci. USA, 2& 4003 (1993)). In 




Has2 in COS cells, in the absence of HA receptor expression, exogenously added 
HA, or proteoglycans, was sufficient for HA coat formation. This suggests that 

1 0 Has2 expression leads to the synthesis of HA, which is extruded through the 
plasma membrane and may associate with the cell surface to form an HA coat 
through continued attachment to the synthase. In this respect, the consensus HA 
binding motifs predicted within mouse Has2 may play an important role. 

HA biosynthesis requires two enzyme activities; the transfer of UDP-N- 

1 5 acetylglucosamine (UDP-GlcNAc) and UDP-glucuronic acid (UDP-GlcUA), 
respectively, to the growing HA chain (Philipson et al., Biochemistry 24 , 7899 
(1985)). In S. pyogenes, a single enzyme, HasA, carries out both activities. In 
contrast, recombinant Xenopus DG42 protein can synthesize short chitin 
oligomers from UDP-GlcN Ac in vitro, but cannot synthesize a hyaluronan chain 

20 in the presence of UDP-GlcNAc and UDP-GlcUA (Semino et al., Proc. Natl. 
Acad. Sci. USA . 22, 3498 (1995)). This suggests that eukaryotic HA synthesis 
requires DG42-like activity and a second enzyme activity provided by a separate 
protein. 

Example 3 

25 cDNA Cloning and Characterization of Human Hyaluronan Synthase-2 and 

Mouse and Human Hyaluronan Synthase-3 

Using degenerate PCR primer pair DEG 1 and DEG 5, described in 

Example 1, PCR products of approximately 300 bp were amplified from human 

and mouse total genomic DNA. The templates for PCR were 100 ng of human 

30 T47D mammary carcinoma cell line genomic DNA, and 100 ng of mouse 129 

Sv/J genomic DNA. Cycling parameters were as follows: 35 cycles of 94°C for 

10 seconds, 50°C for 30 seconds, and 72°C for 1 minute, followed by a final 
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extension step at 72°C for 10 minutes. Amplified fragments of the expected size 
were identified through agarose gel electrophoresis, gel-purified, and cloned 
directly as described in Example 1 . 

Two additional degenerate oligonucleotide primer pools (DEG 10 and 
5 DEG 1 1 ) were designed, based upon the conserved amino acid sequences 
GWGTSGRK (SEQ ID NO:20) and RWLNQQTRW (SEQ ID NO:33) (see 
Figure 14), Similar PCR conditions were used to amplify fragments of the 
expected size from human and mouse genomic DNA using these degenerate 
primers. Amplified PCR products were gel-purified and ligated directly into a 

1 0 cloning vector for sequence analyses. 

Sequences obtained from the clones fell into two groups in both the 
mouse and human. One group of human clones, represented by SEQ ID NO:23, 
shared 88% sequence identity with the equivalent region of mouse Has2 (SEQ 
ID NO: 1 ) (Figure 1 0C), and was 1 00% identical at the amino acid level to SEQ 

1 5 ID NO:2 (Figure 10D). Thus, SEQ ID NO:23 represents a partial nucleotide 
sequence of human Has2. A human fetal lung expressed sequence tag (EST) 
(Genbank Accession No. W21505) shares approximately 90% nucleotide 
sequence identity with SEQ ID NO:l, and close to 100% amino acid identity to 
the predicted carboxy-terminal end of SEQ ID NO:2. 

20 The second group of clones obtained through degenerate PCR, although 

clearly related to Has2 and Hasl, were unique. The genes present in these clones 
has been designated Has3 (Figure 1 1). The mouse and human Has3 genes share 
93% nucleotide identity (SEQ ID Nos. 26 and 25, respectively) and 99% amino 
acid identity (SEQ ID Nos. 28 and 27, respectively). 

25 Based upon the sequence of these partial fragments, a single pair of 

oligonucleotide primers, forward 5'-TAC TGG ATG GCT TTC AAC GTG 
GAG-3' (corresponding to nucleotides 790 to 813, SEQ ID NO:34, Figure 12B), 
and reverse 5'-GTC ATC CAG AGG TGG TGC TTA TGG-3 ' (corresponding to 
antisense complement of nucleotides 1 142 to 1119, SEQ ID NO:37, Figure 12B) 

30 were employed to facilitate PCR screening of a mouse 129Sv PI genomic library 
(Genome Systems, St. Louis, MO). Three positive PI clones were obtained. 
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The restriction fragments spanning the entire mouse Has3 gene were identified, 
the inserts comprising the fragments subcloned into pBluescript (Stratagene, La 
Jolla, CA) based vectors and the inserts sequenced. 



To confirm the sequence obtained from the analysis of genomic clones, 
5 the Has 3 cDNA was obtained. The cDNA was cloned by reverse-transcriptase 




C57BL/6J embryos. First-strand cDNA synthesis was performed as described in 
Example 1 using the mouse Has3 reverse oligonucleotide primer. 

10 First-strand cDNAs were PCR amplified using standard PGR buffer 

conditions supplemented with 2% deionized formamide, through 35 cycles of 
94°C for 10 seconds, 65°C for 30 seconds, and 72°C for 2 minutes, followed by 
a final extension step of 72°C for 10 minutes. Oligonucleotide primers 
possessed EcoRI restriction endonuclease sites (underlined) at their 5' termini to 

15 facilitate subsequent cloning steps. These oligonucleotides included: forward, 
5'-CC OAATTC AAG ATG GCG GTG CAG CTG ACT AC A Gee- 
s' (corresponding to nucleotides 1 to 24, SEQ ID NO:38, Figure 12B), and 
reverse, 5' CC GAATTC TCA CAC CTC CGC AAA AGC CAG GC- 
3'(corresponding to the antisense complement of nucleotides 1665 to 1643, SEQ 

20 ID NO:39, Figure 12B). Amplified cDNAs of the expected size were gel- 
purified and cloned. All sequence analyses were performed using the Genetics 
Computer Group (GCG) package, and MacVector programs. 

The open reading frame (ORF) encoding mouse Has3 is 1662 bp (SEQ 
ID NO:3 1 ) (Figure 12B). This ORF encodes a polypeptide of 554 amino acids 

25 (SEQ ID NO:32) with a predicted molecular mass of 63.3 kDa, This polypeptide 
is only 2 amino acids longer than the mouse Has2 polypeptide. Sequence 
alignments indicated that mouse Has3 is 71%, 57%, 56%, and 28% identical to 
mouse Has2, mouse Hasl (HAS protein), Xenopus DG42, and Streptococcus 
pyogenes HasA, respectively (Figure 13 A). Like Hasl and Has2, residues 

30 demonstrated to be critical for N-acetylglucosaminyltransferase activity of yeast 
chitin synthase 2 are completely conserved. In addition, these residues are 
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conserved with members of a recently identified putative plant cellulose synthase 
family (Pear et al., Proc. Natl. Acad. Sci. USA . 92, 12637 (1996)) (Figure 13B). 

Alignment of the partial sequence of human has3 (HAS3 hereinafter) and 
mouse Has3 (Has3 hereinafter) indicated a very high level of sequence 
5 conservation (99%) (Figure 12A). This is similar to the high level of 

conservation observed for human and mouse HAS1 (96%) and HAS2 (99%). 

Hydrophilicity plots suggested that Has3 is very similar in structure to 
Has2 and Hasl, and predicted the presence of multiple transmembrane domains, 
with two at the N-terminus and a cluster at the C-terminus (Figure 14C). 
10 Significantly, like Has2 and Hasl, the Has3 sequence predicts the presence of 
several potential HA binding motifs defined by the consensus B (X 7 )B 
(underlined in Figure 12B). Furthermore, these motifs are located at similar 
positions within the Has3 polypeptide. 

Example 4 

1 S Molecular Biochemical Characterization of Mouse Has3 

Northern Analysis . To determine the temporal expression pattern of 
mouse Has3 in the developing mouse embryo, Northern blot analysis was 
employed. The mouse Has3 ORF cDNA was labeled with [a 32 P]dCTP by 
random priming (Feinberg and Vogelstein, Anal. Biochem. . 132 . 6 (1984)) and 

20 hybridized to a Northern blot of mouse embryo messenger RNA (CLONTECH, 
Palo Alto, CA) under conditions recommended by the manufacturer. The results 
showed that, in contrast to mouse Has2 which is highly expressed from as early 
as day 7.5 post-coitum through late gestation in the developing mouse embryo, 
mouse Has3 is expressed predominantly in the late gestation embryo (Figure 13). 

25 One major transcript of approximately 6.0-6.5 kb and a minor transcript of 
approximately 4.0 kb were observed (Figure 13). 

Transfection Studies . The mouse Has3 ORF was cloned into the EcoRI 
site of the expression vector pCIneo (Promega, Madison, Wl). To test the 
enzyme activity of mouse Has3, the mouse Has3 expression vector was co- 

30 transfected with a pCMV (J-gal vector into COS-1 (SV40-transformed African 
green monkey kidney) cells using LipofectAMINE™ (Life Technologies Inc., 
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Gaithersburg, MD), according to the manufacturer's instructions. Positive 
control transfections utilized the mouse Has2 expression vector described above. 
HA coat assays and detection of p-galactosidase activity were performed as 
described in Example 2. 



5 pCIneo (vector only control) transfected cells failed to produce coats 




(5 TRU/ml for 1 hour at 37°C) (compare panels E, before hyaluronidase 
treatment, and F, after hyaluronidase treatment, in Figure 15). In contrast, 



10 pericellular coats remained on mock hyaluronidase treated cells (compare panels 
C, before, and D, after mock hyaluronidase treatment in Figure 1 5). Thus, the 
data showed that expression of mouse Has3 in COS-1 cells resulted in the 
generation of well-pronounced HA-dependent pericellular coats, as previously 
observed for Has 2. 

15 To confirm the HA biosynthetic capability of Has3 transfected cells, HA 

synthase assays were performed on crude membranes prepared from these cells. 
Crude cell membrane preparations were isolated as described by Becq et al. 
r Proc. Natl. Acad. Sci. USA . 21, 9160 (1994)), except the final membrane pellets 
were resuspended in 50 |il of lysis buffer (LB) consisting of 10 mM KC1, 1 .5 

20 mM MgCl 2 , and 10 mM Tris-HCI pH 7.4 plus protease inhibitors (aprotinin, 
leupeptin and phenylmethylsulfonyl fluoride) (LB+). Protein content of crude 
membrane preparations was determined by a BCA assay (Pierce, Rockford, IL). 
To detect HA synthase activity, duplicate samples of approximately 100 |ig 
crude membrane protein were incubated overnight at 37°C in a total reaction 

25 volume of 200 \xl under the following conditions: 5 mM dithiothreitol, 15 mM 
MgCl 2 , 25 mM HEPES pH 7.1, 1 mM UDP-GlcNAc, 0.05 mM UDP-GlcUA, 
0.4 ng aprotinin, 0.4 jag leupeptin, 0.5 *iCi UDP-[ I4 C]GlcUA (ICN, Costa Mesa, 
CA). An additional specificity control reaction was set up in which UDP- 
GlcNAc was omitted. After overnight incubation, samples were boiled for 10 

30 minutes, and subsequently divided in two equal portions. Streptomyces 
hyaluronidase (1 turbidity reducing unit (TRU)) was added to one half and 
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incubated for an additional hour at 37°C. SDS was added to a final 
concentration of 1%, samples were boiled and analyzed by descending paper 
chromatography essentially as described in DeAngelis and Weegel, 
Biochemistry . 22, 9033 (1994). 
5 These assays indicated that crude membranes prepared from either Has3 

or Has2 transfected COS-1 cells were capable of converting UDP-[ M C]GlcUA 
into significant amounts of a high molecular weight product only in the presence 
of UDP-GlcNAc (Table 2). Furthermore, this product could be specifically 
degraded by Streptomyces hyaluronidase (Table 2). Thus, in COS-1 cells, Has2 
1 0 and Has3 appear to possess similar enzymatic activities. 

TABLE 2 



Hyaluronan Synthase Activity of Transfected COS-1 Cells 



Vector 


+ UDP-GlcNAc» 


- UDP-GlcNAc 


Hyaluronidase 1 * 


Mouse Has3 


204.2 C 


1.9 d 




pCIneo 


65.0 


2.2 


+ 


Mouse Has2 


26.9 


2.5 




pCIneo 


10.5 


2.0 


+ 


pCIneo (control) 


11.0 


ND e 






10.3 


ND 


+ 



20 a Plus and minus symbols indicate whether or not UDP-GlcNAc was included 
in these reactions. 

b Plus and minus symbols indicate whether or not a reaction was subsequently 
treated for 1 hour at 37°C with I TRU Streptomyces hyaluronidase prior to 
paper chromatography. 
25 c Numbers represent picomoles radiolabeled product formed and were 

calculated taking into account the specific activity of the UDP[ ,4 C]-GlcUA 
used, the amount of cold UDP-GlcUA per reaction, and assumed a 
scintillation counting efficiency of >95%. Based upon these calculations, 1 
picomole of radiolabeled product is represented by 384 disintegrations per 
30 minute (dpm), i.e., 204.2 picomoles product was calculated from 78, 413 
dpm. Numbers represent the mean calculated from duplicate reactions. 

d Number represents the result of a single reaction in each instance. 

c Not determined. 



35 



WO 98/00551 



PCT/US97/11761 



52 

Discussion . The three Has proteins are encoded by three separate but 
related genes, which constitute a mammalian HAS gene family. Sequence 
comparisons and structural predictions suggest that the mammalian HAS 
proteins are very similar in structure. They are predicted to have one or two N- 
terminal transmembrane domains and a cluster of C-terminal transmembrane 




Glvcobiologv. £, 741 (1996)), and to that recently reported for the Rhizobium 
meliloti nodulation factor, NodC (Barny et ai., Molec. Microbiol. . 19. 443 

10 (1996)). In addition, the mammalian HAS sequences, the Xenopus DG42 

sequence, HasA sequence, NodC sequence, and the recently reported putative 
plant cellulose synthases share critical residues shown to be required for N- 
acetylglucosaminyltransferase activity of yeast chitin synthase 2, making it 
highly likely that all these proteins are functionally related processive P- 

1 5 glycosyltransferases. The highly conserved aspartate residues may represent 
sites such as cation binding sites that in turn may coordinate nucleotide-sugar 
interaction with the enzyme. 

While Semino and Robbins have postulated that DG42 and its related 
mammalian homologs, rather than being bona fide HA synthases, may stimulate 

20 HA production through synthesizing chitin oligosaccharide primers, which are 
required for and rate limiting for eukaryotic HA biosynthesis ( Proc. Natl. Acad. 
Sci. USA - 22., 4548 (1996)), cell membranes isolated from baker's yeast, 
Saccharomyces cerevisiae^ engineered to express DG42 have HA synthesis 
activity in vitro when supplied with the required UDP-precursors (DeAngelis 

25 and Achyuthan, J. Biol. Chem. . 27L 23657 (1996)) since S. cerevisiae is 

deficient in UDP-glucuronic acid production, S. cerevisiae is incapable of HA 
biosynthesis. 

Expression of any one of the mammalian HAS proteins in transfected 
mammalian cells leads to a dramatic increase in HA biosynthesis. This would 
30 suggest that the proteins have similar activities. However, the high degree of 
sequence conservation (96-99% identity) between human and mouse HA 
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synthases contrasts with the lower level of identity between synthases within a 
species (Hasl/Has2, 55% identity; Hasl/Has3, 57% identity; Has2/Has3, 71% 
identity), arguing for evolutionary conservation of functionally important 
residues, and for some differences in the mode of action of the three proteins. 
5 Potential differences in function of the proteins could relate to the length of the 
HA chain synthesized, the rate of HA synthesis, the ability to interact with cell- 
type specific accessory proteins, and whether or not the HA is preferentially 
secreted by the cell or alternatively retained by the cell in the form of a 
pericellular coat. 

10 Example 5 

Identification of the Chromosomal Location of the Has Genes 
To determine the chromosomal location of the mouse Has genes, a panel 
of DNA samples, from an interspecific cross that has been characterized for over 
2,000 genetic markers throughout the mouse genome, was analyzed. The genetic 

1 5 markers included in this genetic map span between 50 and 80 centi-Morgans 

(cM) on each mouse autosome and the X chromosome (Chr), and the mapping of 
the reference loci in this interspecific cross are indicated with citations in an 
online database (data can be accessed through the internet as follows: 
http://www.informatics.jax.org/crossdata.html to enter the DNA Mapping Panel 

20 Data Sets from the Mouse Genome Database (MGD), then select the Seldin cross 
and Chromosome). 

Initially, DNAs from two parental mice [C3H/HeJ-#W and (C3H/HeJ-#/rf 
x Mus spretus)F I ] were digested with various restriction endonucleases and 
hybridized with probes specific to mouse Has J, Hasl and Has3 to determine 

25 restriction fragment length variants (RFLVs) to allow haplotype analyses. The 
223 bp mouse Hasl probe was generated through PCR amplification of a full- 
length mouse Hasl cDNA template using oligonucleotide primers, 
5 ' GTC AG AGCT ACTTCC ACTGTG3 ' (SEQ ID NO:53) and 
5'AAGGAGGAGGGCGTCTCCGAG3' (SEQ ID NO:54) (nt positions 947-967 

30 and 1 169-1 149, respectively). The mouse Has2 probe was the MHas300 partial 
cDNA (Figure 2), and the mouse Has3 probe was an equivalent fragment of the 



WO 98/00551 



PCT/US97/11761 



54 

mouse Has3 gene, generated using degenerate PCR primers as described above 
(Example 1). For each gene, informative RFLVs were detected: Hast using 
BamHI restriction endonuciease, C2H/HeJ-#W, 18.0 kb, 6.8 kb; Mus spretus, 2.1 
kb; Has2 using TaqI restriction endonuciease, C3H/HeJ-#W, 3.7 kb; Mus 
5 spretus, 3.9 kb; Has3 using Mspl restriction endonuciease, C3H/HeJ-#W, 1.3 kb, 



these genes segregated to three different mouse autosomes; Has I to mouse Chr 
1 7, Has 2 to mouse Chr 1 5, and Has 3 to mouse Chr 8. The best gene order ± the 

10 standard deviation (Green, In: Genetics and Probability in Animal Breeding 
Experiments (E. Green, ed.), MacMillan, NY, pp. 77-1 13 (1981)) indicated the 
following gene orders: on mouse Chr 17 (centromere) Thbs2 - 0.9 cM ± 0.9 cM 
- Has J - 3.5 cM ± 1.7 cM - Hsp84-1\ on mouse Chr 15 (centromere) Dhfr-rsl - 
14.0 cM db 3.3 cM - Has2 - 0.9 cM ± 0.9 cM - Myc\ and on mouse Chr 8 

15 (centromere) Mil - 5.3 cM ± 2.1 cM - D8MU242 - 0.9 cM ± 0.9 cM - 
Has3/D8Mitl2 - 1 1 .4 cM ± 3.0 cM - D8MU154. 

Pairwise sequence alignments of mouse Has cDNAs with human HAS 
cDNAs permitted the design of oligonucleotide primer pairs specific for the 
respective human HAS genes. Human HAS1 : HAS IF 

20 5 'GTGCTTCTGTCGCTCTACGCG3 ' (SEQ ID NO:49) and Human HAS 1 R 
5 'CC AGTCCC AATATAGTCC AGACTG3 ' (SEQ ID NO.50) (nt positions 
1410-1431 and 1940-1917, respectively, (Shyjan et al., J. Biol. Chem. . 271 . 
23395 (1996)) which amplified a 520 bp fragment. Human HAS2: HAS2F 
5 'GGTGTGTTCAGTGCATTAGTGGA3 ' (SEQ ID NO:5 1) and HAS2R 

25 5 'TAGCC ATCTGAGATATTCTATAGGT3 '(SEQ ID NO:52) (nt positions 
1359-1382 and 1579-1555, respectively, Watanabe and Yamaguchi, J. Biol. 
Chem. . 271 . 22945 (1996)) which amplified a 220 bp fragment. Human HAS 3: 
HAS3F 5 'TGTGC AGTGTATT AGTGGGCCCT3 ' (SEQ ID NO:41) and 
HAS3R 5 'GTTGAGCCACCGGAGGTACTTAG3 ' (SEQ ID NO:43) which 

30 amplified a 220 bp fragment. Conditions used in all PCR reactions were: 0.2 
mM each dNTP, 50 mM KC1, 10 mM Tris-HCl pH 8.3, 1 .5 mM MgCl 2 , 2% 
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deionized formamide, 0.25 U Taq polymerase (Boehringer Mannheim), primers 
at 0,4 |iM, 1 00 ^1 reactions. Cycling parameters for each primer pair were as 
follows: 35 cycles of 94°C for 10 seconds, 67°C (HAS1), 63 °C (HAS2), or 
65°C (HAS3) for 30 seconds, and 72°C for 1 minute, followed by a final 
5 extension step at 72°C for 10 minutes. 

The oligonucleotide primers were used to screen two somatic cell hybrid 
mapping panels (Coriell Institute, Camden, NJ) segregating human 
chromosomes on a mouse or hamster background. Using this approach, the 
human HAS genes were unequivocally assigned to human Chr 19 {HAS1\ Chr 8 

10 (HAS2), and Chr 16 (HAS3). 

To refine the location of human HAS I on Chr 19, the PCR fragment 
described above was used as a probe to screen colony filters of a Chr 19 cosmid 
library (Olsen et al., Genomics . 22, 659 (1994)). Two positive clones, R30674 
and F21560, were identified, neither of which had been incorporated into any of 

1 5 the previously assembled contigs constituting the Chr 1 9 map (Ash worth et al., 
Nat. Genet., H, 422 (1 965)). Alu-PCR products (Parrish et al., Am. J. Hum. 
Genet. , 57f5) . 267 (1995)) from clone F21560 were hybridized to the cosmid 
library and to a genomic Bacterial Artificial Chromosome (BAC) library 
(Shizuya et al., Proc. Natl. Acad. Sci. USA. £2, 8794 (1992)) to form a contig 

20 around the HAS1 gene. The probe identified several additional cosmids that 
were members of a previously assembled contig (CT1665), which had been in 
situ mapped to 19ql3.3, as well as two BACs (BC 79672 and BC56224) which 
extended the HAS1 contig in the opposite direction from CT1665. AJu-PCR 
products from BC56224 were hybridized to cosmids and identified the HAS1 

25 cosmids in addition to numerous clones from another previously assembled 

contig (CT103 1). Clone D1852 from this contig has been incorporated into the 
high resolution pronuclear FISH map of human 19q, placing HAS1 at the ql 3.3- 
13.4 boundary, within the approximately 400 kb region between ETFB 
(Electron-Transferring-Flavoprotein, Beta polypeptide) and FPRl (Formyl 

30 Peptide Receptor 1). EcoRJ mapping confirmed the clone overlaps detected by 
hybridization and indicated a size of 286 kb for the extended HAS 1 contig. In 
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addition to the above mapping results, the localization of HAS 1 to Chr 19ql 3.3- 
1 3.4 was confirmed using a 2. 1 kb human HAS1 cDNA (Itano et al., BBRC 
222, 8 1 6 (1996)) and FISH analysis, as described in Inazawa et al. ( Genomes . 
17. 1 53 (1993)). The mapping results for mouse Has/ and human HAS J 
5 reinforce the recently reported relationship between a small region of human 19q 



human homolog, HAS2 S is located on the long arm of human Chr 8 at band q24.1 
(DeBry and Seldin, Genomics . 22, 337 (1996) and online database: 

10 http://www3.ncbi.nlm.nih.gov/Homology/). This location corresponds to the 
region predicted to contain the gene for the human Langer-Giedion syndrome 
(LGS) (Chen et al., Genomics . 12, 1 17 (1996)), a contiguous genetic syndrome 
characterized by craniofacial deformities, multiple exostoses, mental retardation, 
microcephaly, and redundant skin (Bauermeister and Letts, Ortho ~ Rev. . 2i, 31 

15 (1 962)). To refine the location of human HAS2 on Chr 8, the human HAS2 
primers were used to PGR screen the following human-hamster somatic cell 
hybrids: CL-17, 3;8/4-l, MC2F, 21q+, and TL/UC (Parrish et al., Som. Cell 
Motec. Genet . 20. 143 (1994^): Wagner et al.. Genomics . 1 0. 114(1991)). 
Positive PCR signals were observed for CL-17, 21q+ and 3;8/4-l in addition to 

20 total human DNA, sublocalizing the HAS2 gene to the q arm in interval 1-8 
(Spurr et al., Cvtogenet. Cell Genet. . £3> 147 (1995)). Human HAS2 primers 
were further screened against YACs within the distal portion of a large YAC 
contig (Chen et al., supra). This contig extends from interval 1-1 into interval 1- 
9. Only three of the YACs tested were positive, narrowing the location of HAS2 

25 to the overlapping region between these YACs. This places the human HAS2 
gene at human Chr 8q24.12, close to the DAP-A1 gene, and between the defined 
critical region for the Langer-Giedion syndrome (LGS) and the MYC gene. 
Thus, HAS2 can be excluded as a candidate gene for LGS. 

The localization of the mouse Has 3 gene to mouse Chr 8 near the 

30 D8MU12 locus implicated human Chr 16q as the most likely location for the 

human homolog of this gene. To confirm and refine this localization YAC DNA 
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pools from a YAC map of human Chr 16 (Daggett et al., Nature . 377(5) . 335 
(1995)) were screened with DNA primers that were specific for the human HAS3 
gene, as described above. Three YACs (My782G9, My703C5, and My878A4) 
were identified which produced an amplimer of the correct size with these 
5 primers. These results place the HAS3 gene in band 16q22.1 between the 
somatic cell hybrid breakpoints CY1 27(D) and CY6, and near the E-cadherin 
gene (CDH1) gene and the D16S496 marker. 

All publications and patents are incorporated by reference herein, as 
though individually incorporated by reference. The invention is not limited to 
10 the exact details shown and described, for it should be understood that many 
variations and modifications may be made while remaining within the spirit and 
scope of the invention defined by the claims. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION 
(i) APPLICANT: Mayo Foundation for Medical Education and Research 



(ii) TITLE OF THE INVENTION: GENE ENCODING HYALNURONAN 




(iv) CORRESPONDENCE ADDRESS : 

(A) ADDRESSEE: Schwegman, Lundberg, Woessner & Kluth, P. A. 

(B) STREET: P.O. Box 2938 

(C) CITY: Minneapolis 

(D) STATE: MN 

(E) COUNTRY: U.S. A 

(F) ZIP: 55402 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 
<C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ Version 2.0 

<vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/812,008 

(B) FILING DATE: 05-MAR-1997 

( C ) CLASS I FICATION : 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/812,008 

(B) FILING DATE: 05-MAR-1997 

(A) APPLICATION NUMBER: 08/675,499 

(B) FILING DATE : 03-JUL-1996 



(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Embretson, Janet E 

(B) REGISTRATION NUMBER: 39,665 

(C) REFERENCE /DOCKET NUMBER: 150.183US1 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 612-373-6900 

(B) TELEFAX: 612-339-3061 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 94 7 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

ACATGTAAGA AGAAGGAGAA GTCAAGGCGT CTGGAAAGAA TTACCCAGTC CTGGCTTCGA 60 

GCAGCCCATT GAACGGGGGA CTTGAACCAG CCAAAGACTT CTTCATTCTG CTCTTGCTAG 120 

ACTCTGCTGA GTCTTGACCC GGCTTGTAGG TTGATGTGAA AAGAGATTTT GTGTCGTCGG 18 0 

AGGGAAGGGG ATTGGAGCAA ATAGCAAAAC AGGGGGAAAA GTTAATTTAT CTTTAAAGCA 240 

GATATAACAA AGAATTAGAA GACTTAAGTG CAGCGGAAAT ATAAAGAGAA TATTAGTGAA 3 00 

ATTTCTTCTC AAAGAGGGGA GAACCAAGCA TTTAAGGCTC CCCCATCTTT TTTTTTAAAT 360 

GTTGTTTTTA AATTTCTTAT TTTTTTTGGC CGGTCGTCTC AAATTCATCT GATTTCTTAT 420 

TACCTCAATT TTGGAAACTT CCTTCCACGA CCCTCCGGGA CCACACAGAC AGGCGGAGGA 480 

CGAGTCTATG AGCAGGAGCT GAACAAGATG CATTGTGAGA GGTTTCTATG TGTC CTGAGA 54 0 

ATAATTGGAA CTACACTTTT TGGAGTGTCT CTCCTCCTCG GAATCACAGC TGCTTATATT 6 00 

GTTGGCTACC AGTTTATCCA AACAGATAAT TACTACTTCT CATTTGGACT GTACGGTGCC 66 0 

TTTTTAGCCT CGCATCTCAT CATCCAAAGC CTCTTTGCCT TTTTGGAACA CCGGAAAATG 720 

AAGAAGTCCC TTGAAACCCC GATTAAATTG AACAAAACGG TAGCACTCTG CATCGCTGCG 7 80 

TACCAAGAGG ACCCTGACTA CTTACGGAAA TGTTTG CAAT CTGTGAAAAG GCTGACCTAC 84 0 

CCTGGGATTA AAGTCGTGAT GGTCATCGAT GGGAACTCAG ACGACGACCT TTACATGATG 900 

GACATATTCA GCGAAGTTAT TGGCAGGGAC AAATCGGCCA CGTACATCTG GAAGAACAAC 960 

TTTCATGAAA AGGGACCTGG TGAGACAGAA GAGTCCCATA AAGAAAGTTC ACAACATGTC 1020 

ACCCAATTGG TCTTGTCTAA CAAAAGTATT TGCATCATGC AAAAATGGGG TGGAAAGAGA 1080 

GAAGTCATGT ACACAGCCTT CAGAGCACTG GGGCGAAGCG TGGATTATGT ACAGGTGTGT 1140 

GACTCAGATA CTATGCTTGA CCCTGCCTCA TCTGTGGAGA TGG TGAAGGT CTTAGAGGAA 12 00 

GACCCTATGG TTGGAGGTGT TGGAGGAGAT GTCCAGATTT TAAACAAGTA TGATTCCTGG 1260 

ATCTCCTTCC TCAG CAGCGT GAGATACTGG ATGGCTTTTA ATATAGAAAG GGCCTGCCAG 1320 

TCTTATTTTG G CTGTGTCC A GTGCATAAGC GGTCCTCTGG GAATGTACAG AAACTCCTTG 13 80 

CTGCATGAAT TTGTGGAAGA CTGGTACAAT CAGGAATTCA TGGGTAAC C A ATGCAGTTTT 144 0 

GGTGACGACA GGCACCTTAC CAACAGGGTG TTGAGTCTGG GCTATGCAAC TAAATACACG 1500 

GCTCGGTCCA AGTGCCTTAC TGAAACTCCC ATAGAATATC TGAGATGGCT GAACCAGCAG 1560 

ACCCGATGGA GCAAGTCCTA CTTCCGAGAG TGGCTGTACA ATGCCATGTG GTTTCACAAG 1620 

CATC AC CTGT GGATGACCTA TGAAGCTGTT ATCACTGGAT TCTTTCCTTT CTTTCTCATT 1680 

GCCACAGTCA TCCAGCTCTT CTACAGGGGT AAAATCTGGA ACATCCTCCT CTTCCTGTTA 1740 

ACTGTCCAGC TAGTGGGTCT CATCAAGTCA TCTTTTGCCA GCTGCCTTAG AGGAAATATC 1800 

GTCATGGTAT TCATGTCTCT GTATTCAGTG TTATACATGT CAAGTCTACT TCCTGCCAAG 1860 

ATGTTTGCAA TTGCAACCAT AAACAAAGCT GGGTGGGGCA CATCTGGAAG GAAGACCATT 1920 

GTTGTTAATT TCATAGGACT TATTCCAGTG TCCGTGTGGT TTACAATCCT TCTAGGTGGT 1980 

GTAATTTTCA CCATTTATAA GGAATCTAAA AAGCCATTTT CCGAATCCAA ACAGACTGTT 204 0 

CTCATCGTGG GAACTTTGAT CTATGCATGC TACTGGGTCA TGCTTTTGAC TCTCTATGTG 2100 

GTTCT CATC A ATAAGTGTGG CAGGCGGAAG AAGGGACAAC AGTATGACAT GGTGCTTGAT 2160 

GTATGATGAT GTTTGTAGTC ACACCTGGAG ACACACACAC ACACACATCA CACACACACA 2220 

CACCTTAGCT CCTCAAGGGG CTATACAGTA TTGTGGCACC GCACCCTGCC ACCACAGGAG 2280 

ACATATCACT GCTGCTGGGA CTTGAACAAA GACATTCAAT GGGGGTTGGT TTCTTTTTTA 2340 

TTCTGCCAAA GCAAATTGAT ACATCAGTGA GAAGAAAGTC CGATTAAATC TGACAGTTTT 2400 

AGGACGGTGG GATGATGTCT TGGCTTATGC ACTTTTCCCT TACTGTGCAT CTGCCTGACA 2460 

GTGTTTGTTC TAAATACCTC ACTTGCCATG CTTTGTGTGG GTGATCATGG AAGAAAAGGA 2520 

TTCTGAAAAC TCAAGGGAAC GTTCTTTCAA CCTACACATC CTAACTTATG GACTCTTTTG 2580 

ATAGCTGATG ATTTTCTTTC TATTTTTTGT TTTTAAGGAA AATTGTTCAT CTTTACCAAA 2 640 

TGAAATGCCA AAGGAAAGTT GGAAAGCCAC TGGCTATGCT GTATTTTGAT ATAATAATTG 2700 

TACTGTGTTT TAAATTTTGT ATCCGGATTT TTAAAAACAA AATTTCACAC CATAGTCTAT 2760 

ATTTTACTTC TCTGGCAAAA TACACTTTTG TTCTTTTATA TATATATATA TATATATATA 282 0 
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ATAAAATAGG TTCTAAAAAA ATCCATACTA TAAAAAAAAA TTAACCTGCC CAAAATGTGA 2 8 80 

AACGTGGTTG ACTGATGTTC ATGAAAGAAT AAAATGTTTC TCTCTTTCTC TACATTTTAA 2 94 0 

AAAAAAA 2947 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 552 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 




(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 



Met 


His 


Cys 


Glu 


Arg 


Phe 


Leu 


Cys 


Val 


Leu 


Arg 


He 


He 


Gly 


Thr 


Thr 


1 








5 










10 










15 




Leu 


Phe 


Gly 


Val 


Ser 


Leu 


Leu 


Leu 


Gly 


He 


Thr 


Ala 


Ala 


Tyr 


He 


Val 








20 










25 










30 






Gly 


Tyr 


Gin 


Phe 


He 


Gin 


Thr 


Asp 


Asn 


Tyr 


Tyr 


Phe 


Ser 


Phe 


Gly 


Leu 






35 










40 










45 








Tyr 


Gly 


Ala 


Phe 


Leu 


Ala 


Ser 


His 


Leu 


He 


He 


Gin 


Ser 


Leu 


Phe 


Ala 




50 










55 










60 










Phe 


Leu 


Glu 


His 


Arg 


Lys 


Met 


Lys 


Lys 


Ser 


Leu 


Glu 


Thr 


Pro 


He 


Lys 


65 










70 










75 










80 


Leu 


Asn 


Lys 


Thr 


Val 


Ala 


Leu 


Cys 


He 


Ala 


Ala 


Tyr 


Gin 


Glu 


Asp 


Pro 










85 










90 










95 




Asp 


Tyr 


Leu 


Arg 


Lys 


Cys 


Leu 


Gin 


Ser 


Val 


Lys 


Arg 


Leu 


Thr 


Tyr 


Pro 








100 










105 










110 






Gly 


He 


Lys 


Val 


val 


Met 


Val 


He 


Asp 


Gly Asn 


Ser 


Asp 


Asp 


Asp 


Leu 






115 










120 










125 








Tyr 


Met 


Met 


Asp 


He 


Phe 


Ser 


Glu 


Val 


He 


Gly 


Arg 


Asp 


Lys 


Ser 


Ala 




130 










135 










140 










Thr 


Tyr 


He 


Trp 


Lys 


Asn 


Asn 


Phe 


His 


Glu 


Lys 


Gly 


Pro 


Gly 


Glu 


Thr 


145 










150 










155 










160 


Glu 


Glu 


Ser 


His 


Lys 


Glu 


Ser 


Ser 


Gin 


His 


Val 


Thr 


Gin 


Leu 


Val 


Leu 










165 










170 










175 




Ser 


Asn 


Lys 


Ser 


He 


Cys 


He 


Met 


Gin 


Lys 


Trp 


Gly 


Gly 


Lys 


Arg 


Glu 








180 










185 










190 






Val 


Met 


Tyr 


Thr 


Ala 


Phe 


Arg 


Ala 


Leu 


Gly 


Arg 


Ser 


Val 


Asp 


Tyr 


Val 






195 










200 










205 








Gin 


Val 


Cys 


Asp 


Ser 


Asp 


Thr 


Met 


Leu 


Asp 


Pro 


Ala 


Ser 


Ser 


Val 


Glu 




210 










215 










220 










Met 


Val 


Lys 


Val 


Leu 


Glu 


Glu 


Asp 


Pro 


Met 


Val 


Gly 


Gly Val 


Gly 


Gly 


225 










230 










235 










240 


Asp 


Val 


Gin 


He 


Leu 


Asn 


Lys 


Tyr 


Asp 


Ser 


Trp 


He 


Ser 


Phe 


Leu 


Ser 










245 










250 










255 




Ser 


Val 


Arg 


Tyr 


Trp 


Met 


Ala 


Phe 


Asn 


He 


Glu 


Arg 


Ala 


Cys 


Gin 


Ser 








260 










265 










270 






Tyr 


Phe 


Gly 


Cys 


Val 


Gin 


Cys 


He 


Ser 


Gly 


Pro 


Leu 


Gly 


Met 


Tyr 


Arg 






275 










280 










285 








Asn 


Ser 


Leu 


Leu 


His 


Glu 


Phe 


Val 


Glu 


Asp 


Trp 


Tyr 


Asn 


Gin 


Glu 


Phe 




290 










295 










300 










Met 


Gly 


Asn 


Gin 


Cys 


Ser 


Phe 


Gly 


Asp 


Asp 


Arg 


His 


Leu 


Thr 


Asn 


Arg 
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305 310 315 320 

Val Leu Ser Leu Gly Tyr Ala Thr Lys Tyr Thr Ala Arg Ser Lys Cys 

325 330 335 

Leu Thr Glu Thr Pro He Glu Tyr Leu Arg Trp Leu Asn Gin Gin Thr 

340 345 350 

Arg Trp Ser Lys Ser Tyr Phe Arg Glu Trp Leu Tyr Asn Ala Met Trp 

355 360 365 

Phe His Lys His His Leu Trp Met Thr Tyr Glu Ala Val He Thr Gly 

370 375 380 

Phe Phe Pro Phe Phe Leu He Ala Thr Val He Gin Leu Phe Tyr Arg 
385 390 395 400 

Gly Lys He Trp Asn He Leu Leu Phe Leu Leu Thr Val Gin Leu Val 

405 410 415 

Gly Leu He Lys Ser Ser Phe Ala Ser Cys Leu Arg Gly Asn He Val 

420 425 430 

Met Val Phe Met Ser Leu Tyr Ser Val Leu Tyr Met Ser Ser Leu Leu 

435 44 0 44 5 

Pro Ala Lys Met Phe Ala He Ala Thr He Asn Lys Ala Gly Trp Gly 

450 455 460 

Thr Ser Gly Arg Lys Thr He Val Val Asn Phe He Gly Leu He Pro 
465 470 475 480 

Val Ser Val Trp Phe Thr He Leu Leu Gly Gly Val He Phe Thr He 

485 490 495 

Tyr Lys Glu Ser Lys Lys Pro Phe Ser Glu Ser Lys Gin Thr Val Leu 

500 505 510 

He Val Gly Thr Leu He Tyr Ala Cys Tyr Trp Val Met Leu Leu Thr 

515 520 525 

Leu Tyr Val Val Leu He Asn Lys Cys Gly Arg Arg Lys Lys Gly Gin 

530 535 540 

Gin Tyr Asp Met Val Leu Asp Val 
545 550 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Arg Gin Asp Met Pro Lys Pro Ser Glu Ala Ala Arg Cys Cys Ser 

15 10 15 

Gly Leu Ala Arg Arg Ala Leu Thr He He Phe Ala Leu Leu He Leu 

20 25 30 

Gly Leu Met Thr Trp Ala Tyr Ala Ala Gly Val Pro Leu Ala Ser Asp 

35 40 45 

Arg Tyr Gly Leu Leu Ala Phe Gly Leu Tyr Gly Ala Phe Leu Ser Ala 

50 55 60 

His Leu Val Ala Gin Ser Leu Phe Ala Tyr Leu Glu His Arg Arg Val 
65 70 75 80 

Ala Ala Ala Ala Arg Arg Ser Leu Ala Lys Gly Pro Leu Asp Ala Ala 
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85 90 95 

Thr Ala Arg Ser Val Ala Leu Thr lie Ser Ala Tyr Gin Glu Asp Pro 

100 105 110 

Ala Tyr Leu Arg Gin Cys Leu Thr Ser Ala Arg Ala Leu Leu Tyr Pro 

115 120 125 

His Thr Arg Leu Arg Val Leu Met Val Val Asp Gly Asn Arg Ala Glu 

130 135 140 

Asp Leu Tyr Met Val Asp Met Phe Arg Glu Val Phe Ala Asp Glu Asp 
145 150 155 160 

Pro Ala Thr Tyr Val Trp Asp Gly Asn Tyr His Gin Pro Trp Glu Pro 










180 










185 










190 






Ala 


Glu 


Asp 


Pro 


Gly 


Arg 


Leu 


Ala 


Val 


Glu 


Ala 


Leu 


Val 


Arg 


Thr 


Arcr 






195 










200 










205 








Arcr 


Cys 


Val 


Cys 


Val 


Ala 


Gin 


Arg 


Trp Gly Gly 


Lys 


Arg 


Glu 


Val 


Met 




210 










215 










220 










Tyr 


Thr 


Ala 


Phe 


Lys 


Ala 


Leu 


Gly Asp 


Ser 


Val 


Asp 


Tyr 


Val 


Gin 


Val 


225 










230 










235 










240 


Cys 


Asp 

IT 


Ser 


Asp 


Thr 


Arg 


Leu 


Asp 


Pro 


Met 


Ala 


Leu 


Leu 


Glu 


Leu 


Val 










245 










250 










255 




Arcr 


Val 


Leu 


ASP 


Glu 


Asp 


Pro 


Arg 


Val 


Gly 


Ala 


Val 


Gly Gly Asp 


Val 








260 










265 










270 






Arcr 


lie 


Leu 


Asn 


Pro 


Leu 


Asp 


Ser 


Trp 


Val 


Ser 


Phe 


Leu 


Ser 


Ser 


Leu 






275 










280 










285 








Arg 


Tyr 


Trp 


Val 


Ala 


Phe 


Asn 


Val 


Glu 


Arg 


Ala 


Cys 


Gin 


Ser 


Tyr 


Phe 




290 










295 










300 










His 


Cys 


val 


Ser 


Cys 


He 


Ser 


Gly 


Pro 


Leu 


Gly 


Leu 


Tyr 


Arg 


Asn 


Asn 


305 










310 










315 










320 


Leu 


Leu 


Gin 


Gin 


Phe 


Leu 


Glu 


Ala 


Trp 


Tyr 


Asn 


Gin 


Lys 


Phe 


Leu 


Gly 










325 










330 














Thr 


His 


Cys 


Thr 


Phe 


Gly 


Asp 


Asp 


Arg 


His 


Leu 


Thr 


Asn 


Arg 


Met 


Leu 








340 










345 










350 






Ser 


Met 


Gly 


Tyr 


Ala 


Thr 


Lys 


Tyr 


Thr 


Ser 


Arg 


Ser 


Arg 


Cys 


Tyr 


Ser 






355 










360 








• 


365 








Glu 


Thr 


Pro 


Ser 


Ser 


Phe 


Leu 


Arg 


Trp 


Leu 


Ser 


Gin 


Gin 


Thr 


Arg 


Trp 




370 










375 










380 










Ser 


Lys 


Ser 


Tyr 


Phe 


Arg 


Glu 


Trp 


Leu 


Tyr 


Asn 


Ala 


Leu 


Trp 


Trp 


His 


385 










390 










395 










400 


Arg 


His 


His 


Ala 


Trp 


Met 


Thr 


Tyr 


Glu 


Ala 


Val 


Val 


Ser 


Gly 


Leu 


Phe 










405 










410 










415 




Pro 


Phe 


Phe 


Val 


Ala 


Ala 


Thr 


Val 


Leu 


Arg 


Leu 


Phe 


Tyr 


Ala 


Gly 


Arg 








420 










425 










430 






Pro 


Trp 


Ala 


Leu 


Leu 


Trp 


Val 


Leu 


Leu 


Cys 


Val 


Gin 


Gly 


Val 


Ala 


Leu 






435 










440 










445 








Ala 


Lys 


Ala 


Ala 


Phe 


Ala 


Ala 


Trp 


Leu 


Arg 


Gly 


Cys 


Val 


Arg 


Met 


Val 




450 










455 










460 










Leu 


Leu 


Ser 


Leu 


Tyr 


Ala 


Pro 


Leu 


Tyr 


Met 


Cys 


Gly 


Leu 


Leu 


Pro 


Ala 


465 










470 










475 










480 


Lys 


Phe 


Leu 


Ala 


Leu 


Val 


Thr 


Met 


Asn 


Gin 


Ser 


Gly 


Trp 


Gly 


Thr 


Ser 










485 










490 










495 




Gly 


Arg 


Lys 


Lys 


Leu 


Ala 


Ala 


Asn 


Tyr 


Val 


Pro 


Val 


Leu 


Pro 


Leu 


Ala 








500 










505 










510 






Leu 


Trp 


Ala 


Leu 


Leu 


Leu 


Leu 


Gly 


Gly 


Leu 


Ala 


Arg 


Ser 


Val 


Ala 


Gin 






515 










520 










525 
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Glu Ala Arg Ala Asp 
530 

His Leu Ala Ala Gly 
54S 

Leu Thr lie Tyr Trp 
565 

Gly Gly Tyr Arg Val 
580 



Trp Ser Gly Pro Ser Arg 
535 

Ala Gly Ala Tyr Val Ala 
550 555 
Val Gly Val Arg Arg Leu 
570 

Gin Val 



Ala Ala Glu Ala Tyr 
540 

Tyr Trp Val Val Met 
560 

Cys Arg Arg Arg Ser 
575 



(2) INFORMATION FOR SEQ ID NO : 4 : 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 587 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
{ D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 



Met Lys Glu Lys 
1 

Lys Asp Leu Glu 
20 

Ser Phe Gly Val 
35 

Glu Phe Gin Val 
50 

Tyr Gly Leu Ala 
65 

Phe Leu Glu lie 

Lys Lys Thr Val 
100 

Tyr Leu lie Lys 
115 

Asp Lys Leu Lys 
130 

Ala Tyr Met Met 
145 

Gly Thr Tyr Val 

Glu Thr Asn Lys 
180 

Asp Glu Gly lie 
195 

Val Cys He Met 
210 

Phe Gin Ala He 
225 

Asp Thr Lys Leu 

Glu Ser Asn Asp 
260 



Ala Ala Glu Thr 
5 

Pro Lys His Pro 

Val Leu Leu Ala 
40 

Leu Lys His Glu 

55 

Met Leu Leu His 
70 

Arg Arg Val Asn 
85 

Ala Leu Thr He 

Cys Leu Glu Ser 
120 

He He Leu Val 
135 

Glu Met Phe Lys 
150 

Trp Lys Gly Asn 
165 

Gly Ser Cys Pro 

Asn Met Val Glu 
200 

Gin Gin Trp Gly 
215 

Gly Thr Ser Val 
230 

Asp Glu Leu Ala 
245 

Met Tyr Gly Ala 



Met Glu He Pro 
10 

Thr Leu Trp Arg 
25 

Thr He Thr Ala 

Ala He Leu Phe 
60 

Leu Met Met Gin 
75 

Lys Ser Glu Leu 
90 

Ala Gly Tyr Gin 
105 

Cys Lys Tyr Val 

He Asp Gly Asn 
140 

Asp Val Phe His 
155 

Tyr His Thr Val 
170 

Glu Val Ser Lys 
185 

Glu Leu Val Arg 

Lys Arg Glu Val 
220 

Asp Tyr Val Gin 
235 

Thr Val Glu Met 
250 

Val Gly Gly Asp 

265 



Glu Gly He Pro 
15 

He He Tyr Tyr 
30 

Ala Tyr Val Ala 
45 

Ser Leu Gly Leu 

Ser Leu Phe Ala 
80 

Pro Cys Ser Phe 
95 

Glu Asn Pro Glu 
110 

Lys Tyr Pro Lys 
125 

Thr Glu Asp Asp 

Gly Glu Asp Val 
160 

Lys Lys Pro Glu 

175 

Pro Leu Asn Glu 
190 

Asn Lys Arg Cys 
205 

Met Tyr Thr Ala 

Val Cys Asp Ser 
240 

Val Lys Val Leu 
255 

Val Arg He Leu 
270 
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Asn Pro Tyr Asp Ser Phe lie Ser Phe Met Ser Ser Leu Arg Tyr Trp 

275 280 285 

Met Ala Phe Asn Val Glu Arg Ala Cys Gin Ser Tyr Phe Asp Cys Val 

290 295 300 

Ser Cys lie Ser Gly Pro Leu Gly Met Tyr Arg Asn Asn He Leu Gin 
305 310 315 320 

Val Phe Leu Glu Ala Trp Tyr Arg Gin Lys Phe Leu Gly Thr Tyr Cys 

325 330 335 

Thr Leu Gly Asp Asp Arg His Leu Thr Asn Arg Val Leu Ser Met Gly 
340 345 350 




Ser Leu Tyr Leu Arg Trp Leu Asn Gin Gin Thr Arg 
370 375 380 

Tyr Phe Arg Glu Trp Leu Tyr Asn Ala Gin Trp Trp 
385 390 395 

He Trp Met Thr Tyr Glu Ser Val Val Ser Phe He 

405 410 
He Thr Ala Thr Val He Arg Leu He Tyr Ala Gly 

420 425 
Val Val Trp Leu Leu Leu Cys He Gin He Met Ser 

435 440 
He Tyr Ala Cys Trp Leu Arg Gly Asn Phe He Met 
450 455 460 

Leu Tyr Ser Met Leu Tyr Met Thr Gly Leu Leu Pro 
465 470 475 

Ala Leu Leu Thr Leu Asn Lys Thr Gly Trp Gly Thr 

485 490 
Lys He Val Gly Asn Tyr Met Pro He Leu Pro Leu 

500 505 
Ala Val Leu Cys Gly Gly Val Gly Tyr Ser He Tyr 

515 520 
Asn Asp Trp Ser Thr Pro Glu Lys Gin Lys Glu Met 
530 535 540 

Tyr Gly Cys Val Gly Tyr Val Met Tyr Met Val lie 
545 550 555 

Tyr Trp Val Trp Val Lys Arg Cys Cys Arg Lys Arg 

565 570 
Thr Leu Val His Asp He Pro Asp Met Cys Val 
580 585 



Trp Thr Lys 

His Lys His 

Phe Pro Phe 
415 

Thr He Trp 

430 
Leu Phe Lys 
445 

Leu Leu Met 

Ser Lys Tyr 

Ser Gly Arg 
495 

Ser He Trp 

510 
Met Asp Cys 
525 

Tyr His Leu 

Met Ala Val 

Ser Gin Thr 
575 



Ser 



His 
400 
Phe 

Asn 

Ser 

Ser 

Phe 
480 
Lys 

Ala 

Gin 

Leu 

Met 
560 
Val 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 419 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 



Met Pro He Phe Lys Lys Thr Leu He Val Leu Ser 

1 ' ' 5 10 



Phe He Phe Leu 
15 
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He Ser He Leu He Tyr Leu Asn Met Tyr Leu Phe Gly Thr Ser Thr 

20 25 30 

Val Gly He Tyr Gly Val He Leu lie Thr Tyr Leu Val He Lys Leu 

35 40 45 

Gly Leu Ser Phe Leu Tyr Glu Pro Phe Lys Gly Asn Pro His Asp Tyr 

50 55 60 

Lys Val Ala Ala Val He Pro Ser Tyr Asn Glu Asp Ala Glu Ser Leu 
65 70 75 80 

Leu Glu Thr Leu Lys Ser Val Leu Ala Gin Thr Tyr Pro Leu Ser Glu 

85 90 95 

He Tyr He Val Asp Asp Gly Ser Ser Asn Thr Asp Ala He Gin Leu 

100 105 110 

He Glu Glu Tyr Val Asn Arg Glu Val Asp He Cys Arg Asn Val He 

115 120 125 

Val His Arg Ser Leu Val Asn Lys Gly Lys Arg His Ala Gin Ala Trp 

130 135 140 

Ala Phe Glu Arg Ser Asp Ala Asp Val Phe Leu Thr Val Asp Ser Asp 
145 150 155 160 

Thr Tyr He Tyr Pro Asn Ala Leu Glu Glu Leu Leu Lys Ser Phe Asn 

165 170 175 

Asp Glu Thr Val Tyr Ala Ala Thr Gly His Leu Asn Ala Arg Asn Arg 

180 185 190 

Gin Thr Asn Leu Leu Thr Arg Leu Thr Asp He Arg Tyr Asp Asn Ala 

195 200 205 

Phe Gly Val Glu Arg Ala Ala Gin Ser Leu Thr Gly Asn He Leu Val 

210 215 220 

Cys Ser Gly Pro Leu Ser He Tyr Arg Arg Glu Val He He Pro Asn 
225 230 235 240 

Leu Glu Arg Tyr Lys Asn Gin Thr Phe Leu Gly Leu Pro Val Ser He 

245 250 255 

Gly Asp Asp Arg Cys Leu Thr Asn Tyr Ala He Asp Leu Gly Arg Thr 

260 265 270 

Val Tyr Gin Ser Thr Ala Arg Cys Asp Thr Asp Val Pro Phe Gin Leu 

275 280 285 

Lys Ser Tyr Leu Lys Gin Gin Asn Arg Trp Asn Lys Ser Phe Phe Arg 

290 295 300 

Glu Ser He He Ser Val Lys Lys He Leu Ser Asn Pro He Val Ala 
305 310 315 320 

Leu Trp Thr He Phe Glu Val Val Met Phe Met Met Leu He Val Ala 

325 330 335 

He Gly Asn Leu Leu Phe Asn Gin Ala He Gin Leu Asp Leu He Lys 

340 345 350 

Leu Phe Ala Phe Leu Ser He He Phe He Val Ala Leu Cys Arg Asn 

355 360 365 

Val His Tyr Met Val Lys His Pro Ala Ser Phe Leu Leu Ser Pro Leu 

370 375 380 

Tyr Gly He Leu His Leu Phe Val Leu Gin Pro Leu Lys Leu Tyr Ser 
385 390 395 400 

Leu Cys Thr He Lys Asn Thr Glu Trp Gly Thr Arg Lys Lys Val Thr 
405 410 415 

He Phe Lys 



(2) INFORMATION FOR SEQ IP NO : 6 : 
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<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 426 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS ; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 




Trp Thr 
Val Ala 



lie Leu 
340 
Leu His 



Ala Arg 



Arg Cys 
350 
Phe Val 



Leu His 
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FURTHER INFORMATION CONTINUED FROM PCTrtSA/ 210 



1. Claims: 1-7, 16, 18, 20, 21, 24, 28, 29, 32, 34, 36, 38, 

42, 44 and 45, and partially, 26, 27, 40 and 41 



DNA molecule encoding hyaluronan synthase-2, expression 
cassette and host cell comprising the same, uses thereof for 
producing hyaluronan synthase-2 and for altering the amount 
of hyaluronan produced by a cell; hyaluronan synthase-2 
polypeptide, use thereof for treating a condition associated 
with alteration in hyaluronan synthesis or extracellular 
accumulation; method for identifying a mairmal affected by 
aberrant hyaluronan synthesis or extracellular accumulation 
by using a mammalian hyaluronan synthase-2 binding agent or 
an hyaluronan synthase-2-specif ic oligonucleotide; method 
for detecting hyaluronan synthase-2 DNA; use of an agent 
effective to alter native hyaluronan synthase-2 activity in 
a therapeutic method; method to prepare hyaluronan by using 
hyaluronan synthase-2. 



2. Claims: 8-15, 17, 19, 22, 23, 25, 30, 31, 33, 35, 37, 39, 
43, 46 and 47, and partially 26, 27, 40 and 41 



DNA molecule encoding hyaluronan synthase-3, primer or probe 
thereof, expression cassette and host cell comprising such 
DNA molecule, uses thereof for producing hyaluronan 
synthase-3 and for altering the amount of hyaluronan 
produced by a cell; hyaluronan synthase-3 polypeptide, use 
thereof for treating a condition associated with alteration 
in hyaluronan synthesis or extracellular accumulation; 
method for identifying a mammal affected by aberrant 
hyaluronan synthesis or extracellular accumulation by using 
a manmalian hyaluronan synthase-3 binding agent or an 
hyaluronan synthase-3-specific oligonucleotide; method for 
detecting hyaluronan synthase-3 DNA; use of an agent 
effective to alter native hyaluronan synthase-3 activity in 
a therapeutic method; method to prepare hyaluronan by using 
hyaluronan synthase-3. 
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MHas 3 - -MPVQLTTALR- WGTSL FALWLCG I LAAYVTGYQFI HTEKHYLS FGLYGAI L 

MHas2 -MHCERFLCVLR- 1 IGTTL FGVS LLLG X TAAY I VGYQFIQTOMYYFSFGLYOAF'L 

MHasl MRQDMPKPSEAARCCSGtARRALTI I FALL I LGI24TWAYAAGVP IASDRYGIJLAPcn«YGAFL 

DG4 2 MKEKAAETME I PEGIPKDLE PKH PTLWRI I YYS FGWLLAT I TAAYVAE FQVLKH EAI LFSLGLYGLAM 

Ha* A MPIFKKTLIVLSFIFLISILXYLKMYLFGTST VGI-YGVILITYLVIKL GLSF 

* * # * * * 

MKas3 GLHLL IQSLFAFLEHRRM RRA-GRPUOJiCSQRSRSVAIjCIAAYQEDPEYIJRRC^ 

MHas 2 ASHLI IQSLFAFLEHRKM KXSLETPIKLN KTVALC IAAYQEDPD YLRKCLQSVWRLTYPG 

MHasl S AHLVAQSLFAYLEHRRVAAAARRS LAKGPLDAATA — RSVALTISAYQEDPAYLRQCLTSARALLYPH 

OG42 IXHLMMQSLFAFLE IRRV NKSEL-PCSFK KTVALTIAGYQENPEYL IKCLESCKYVKYPK 

Ha sA LYE PFKGNPHDY K-VAAVI PSYHEDAE SLLETLKSVLAQTYPL 

• ** * • * # * 

MHaa3 LK — VVMVVDGNRQMTYMIJDIFHEVtXSOTEQAGF EASLQEGMERV 

MHas 2 IK — VVhfVIOGKSODOLYMMDIFSEVMGRDK-SATYXWKNHFSE-KSPGET EESHKESSQHV 

MHasl TR1AVLMVVDGNRAEDLYMVDMFREVFADED- PATYVWDGNYHQPWE PAEATGAVGE GAYREVEAEDPG 
DG42 DlOiCI XLVXOCaiTCOOATt^KFKDVFRGED-VGTYVITKGm 

HasA S — E IYXVDOGSSKTDAIQL IEEYVNRE VDICRNVIVHR 

* # # *#* * ** « 

MHas3 HA WWASTFSCI>^KWGGKREVMYTAFKAU^SVDYIQV^ 

MHas 2 TQ- LVLSNKS XCXM&KWGGKREVMYTAFRAX«GRSVDYVQV^ 

MHas 1 RIAV^ALVRTRRCVCVAQRHGGKREVMYTAFKA^ 

DG4 2 I NMVE E LVRNKRCVC IMQQWGGKREVMYTAFQAI GTSVDYVQVCD SDTKLDE LATVEMVKVI*ESNDMYG 

HasA S LVKK G -KRHAQAWXFTRS DADV- FLTV-DSDTY I YPNALEELLKSFNDE TVYA 

ft * * «* * *** * 

MHas3 GVGCTOVQILHKYDSWISFLSSVRYWM^FKVERACQSYFGCVQCI 

MHas 2 GVGGOVQILNKYDStTCSFLSSVKYHMAFNIER^^ 

MHas 1 AVI(roVRILNPII>SWVSFTSSIJlYWAFNW 

DG4 2 AVGGDVRI LWPYOS FTS FMS S LRYWMAFNVEKACQSYFOCVSC I SGPLGMYRNN X LQVFIXAWYRQKFL 

HasA ATG-HLNARNRQTNIXTRLTD IRYDNAFGVERAAQSLTGNI LVCSGPLS XYRREVX I PNLERYKNQTFL 
* * * . ** ** * * * * ** * * 

MHas 3 GSKCSFGDDRHLTHKVI^lX^fRTKYTARSKCLre 

MHas2 GNQCSFGDDRHLTNRVI^IXrrATKYTARSKCL^ 

MHas 1 GTHCTFGDDRHLTKRMI^SMGYATKYYSRSRCYSETPSS FXRWI^QQTTOySKH ^ ir'KEWLYNALWWHRflHA 

DG4 2 GTYCTI^ORHLTNRVI^OnfRTKYTHKSRAre^ 

HasA GLPVSIGDDRCLTNYAIDLG-RTVYQSTARCDTDVPFQI^ 

* ** * * ** * * * 

MHa s 3 W MTYESW TGFFPFFLIATVIQLFYRGRI - -WNILLFLLTVQLVGI IKATYACFLRG 

MHas 2 W MTYEAVI TGFFPFFX*IATVXQLTYRflKI — WNIIXFIXTWQLVCLIJCBS FASCLRG 

MHasl W MTYEAW SGLFPFFVAATVLRLFTAGRP — KMJLH\OXC^^^AlJVICXAFAM9XTC 

DG42 W MTYESW SFIFPFFITATVXRLIYAGTI — WNVVWIXLCIQIMSLFJCSIYACWLRG 

HasA VAUrriFEWMFWMLIVAIGmXFlK^ IFIVALC R-NVHYMVKH 

* ** * * ft* * * * * * *• 

MHas 3 NAEMX FMSLYSLLYMS SXXPAKI FAIATIKKSGWGTSGRKT IWNFIGLI PVS I WVAVI-LGGIAYTAY - 

MHas2 NI VMVTMSLYSVX»YMSSIJmPAKMFAIATIHKAGTOTSGRKT XWHFXGLXPVSWFTXXJJGGVXFTXYK 

MHasl CVT?KVIJ^LYAPLYfCGIXPAKFIAL^ 

OG4 2 MF XMLXMSLY6MLYMTGIXPSKYFALLTU«TGTCTSGR I LPLS IKAAVLCGGVGYS I YM 

HasA PASFIJLSPLYGILHLFVI^PIJCLYfil/^IKKTEWaTRKKVTIFK* 



MHas 3 -CQDLFSET E LAFLVSGAI LYGCYWVALLMLYLAI IARRJCG - - KKPEQYS LAFAEV • 

MHas 2 E SKKPFSES - KQ TVTjIVGTLIYACYKVM1XTLYWLI~NKCGRRKKGQQYDMV-LOV» 

MHasl EARADK8GPSRAA£AYKLAAjCSAGAXVAYKWMLT I YWVGVRRLC- RRRSGG- YRVQV • 

OG4 2 DCQtTOWSTPEKQKEMYHIXYGCVGYVMYWIMAVMYVMfVKR 



FIG. 14A 

SUBSTITUTE SHEET (RULE 26) 



WO 98/00551 



14/18 



PCT/US97/11761 




FIG. 13 

SUBSTITUTE SHEET (RULE 26) 



80OO0: <WO__S800M1AajL> 



WO 98/00551 



PCT/US97/I1761 



13/18 



1 'atcccggtgcagctgactacagccctgcgtgtggtgggcaccagtctgtttgccctggtagtgcto 

mpvqlttalrvvgtslfalvvl 22 
67 ggaggcatcctggcggcctatgtgacaggctaccagtttatccacacaga^^ 

GGILAAYVTGYQFIHTEKHYI-S 44 
133 TTTGGCCTCTACGGTGCCATCCTGGGTCTAC^ 

FGLYGAILGLHLLIQSLFAFLE 66 
199 CACCGTCGAATGCGCAGGGCAGGGCGCCCCCTCA^ 

HRRMRRAGRPL KLHCSOpftR S V B8 
265 GCACTCTG CATTGCTG CCTACC AAGAGGACCCCGAATACCTGCGCAAGTGCCTTCG CTCAGCTCAG 

ALCIAAYQEDPEYL RKCLRSAO 110 
331 CGCATTGCCTTTCCAAACCTCAAGGTGGTCATGGTAGTGGATGGCAA 

EIAFPNLKVVMVVDGNRQEDTY 132 
3 97 ATGTTGGACATCTTCCATGAGGTGCTGGGTGGCAC^ 

MLDIFHEVLGGTEQAGF FVWRS 154 
463 AATTTCCATGAGGCGGGTGAAGGAGAGACAGAGGCCAGCCTGCAGGAAGGCATGGAGCGTGTGCGA 

NFHEAGEGETEASLQEGMER VR 176 
529 GCTGTGGTGTGGGCCAGCACCTTCTCATGCATCATGCAGAAG 

AVV^WASTFSCTMQK W*G G K R E V M 198 
595 TACACTGCCTTCAAGGCCCTTGGCAACTO 

YTAFKALGNSVDYIQVCDSDTV 220 
661 CTGGACCCAGCCTGCACCATTGAGATGCTTCGAGTCTTGGAAG 

LDPAC*TIE. MLRVLEEDPQVGGV 242 

727 ggaggagatgtccaAtcctcaacaagtatgattcatggatctccttcctgagc^ 

ggdvqilkkydswisflssvry 264 

793 tggatggctttcaacgtggagcgggcctgccagtcctacttt^ 

wmafkveracqsyfgcvqcisg 286 

859 cctttgggcatgtaccgcaacagcctccttcagcagttcctgga 

PLGMYRMSLL-QQFLEDWYHQKF 308 
925 CTAGGCAGCAAGTGCAGCTTTGGGGATGATCGGCACCTTACCAACCGAG 

L GSKCSFGDDRHLTNRVLSLGY 330 
991 CGGACTAAGTATAGAGCACGCTCTAAGTGCCTCACAGAGA 

RTKVTARSK CLTETPTRYLRWL 352 
1057 AATCAGCAAACCCGCTGGAGCAAGTCTTACTTTCGGGAATC 

NQQTRWSKSYFREWLYNSLWFH 374 
112 3 AAGCACCACCTCTGGATGACCTATGAATCAGTGGT^ 

KHHLWMTYESVVTGFFPFFLIA 396 
1189 ACAGTCATACA ACTTTTCTACCGTGGCC^^ 

TVIQLFYRGRIWNILLFLLTVQ 418 
1255 CTGGTGGGCATTATCAAGGCTACCTATGC^ 

h V G I I KATYACFTjR GNAEMIFM 440 
1321 TCCCTCTACTCCCTTCTCTATATGTCCAGCCTCTTGCCAGCCAA^ 

SLYStiI*YMSSLLPAKIFAIATI 462 
1387 AACAAGTCTGGCTGGGGCACTTCTGGCAGGAAAACCATTGT 

N K5GWGT5QR KTIVVNFIGLIP 484 
1453 GTGTCCATCTGGGTGGOSGTTCTT 

VSIWVAVLLGGLAYTAY CQDLF 506 
1519 AGTG AG ACCG A£CT AGCCTTCCTAGTCTCTGGGGCCA CTGCTACTGGGTGGCCCTC 

S ETELAFLVSGAI LYGC YW.VAL 528 
1585 CTCATGCTGTATCTGGCOATTATT^ 

LMLYLAI IARRCGKKP EQYSLA 550 
1651 TTTGCGGAGGTGTGA 

F A E V 554 
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1 GTOCTACTTT GGCTGTCTGC AGTGTATTAG TOGQCCC1TC GGCATCTACC 

51 GCAACAGCCT CCTCCAGCAG TTCCTQGAGG ACTGGTACCA TCAGAAGTTC 

101 CTAGGCAGCA AGTGCAGCTT CGGGGATGAC CGGCACCTCA CCAACCGAGT 

151 CCTGAGCCTT GGCTACQGAA CTAAGTATAC CX3CQCGCTCC AAGTCCCTCA 

201 CAGAGACCCC CACTAAGTAC CTCCGGTCGC TCAAC 

FIG. 11A 



1 GTOCTACTTT GGCTGTGTOC AATGTATTAG TOQGCCTTTG GGCATCTACC 

51 GCAACAGCCT CCTTCAGCAG TTCCTQGAGG ATTOGTAOCA TCAGAAGTTC 

101 CTAGGCAGCA AGTGCAGCTT TQQGGATGAT CQGCACCTTA CCAACCGAGT 

151 CCTGAGTCTT GGCTACCGGA CTAAGTATAC AGCACGCTCT AAGTGCCTCA 

201 CAGAGACCCC CACTAGGTAC CTTOGATQGC TCAAT 

FIG. 1 1B 



MHas3 1 GTCCTOCTTTGGCTGTOTQCAAT^ 50 

II Ml I I M II ) HI I 111 i I M I I I I I f I I I I M III II 1 1 1 1 I 1 1 1 
HHAS3 1 GTtXTTACTTTOGCICTCT 50 

• • * « • 

51 OCMCAGCCrrCCTI^^ 100 

MMMIMMII lllli Kill II I INI I f I i 1 1 1 1 1 1 M 1 1 1 1 1 1 
51 GCAACAGCCTCCIXX^GC^^ 100 

* • • * 

101 CTAGGCftGCAAGTQCAGCITO 150 

1 1 1 1 1 1 1 1 1 f I f 1 1 1 1 1 1 f f Ml II in II (if in 1 1 ii ill mi 

101 CTAGQCAGCAAGTGa^GCTirc^ 150 

• • * • • 
151 a^TGAGTCTXXCTAOC^ 200 

iiiiii 1 1 1 1 1 1 1 1 1 1 1 i i i 1 1 1 1 1 1 1 1 ii mil Milium 

151 CXrrcaiXCTTGGCTACOGAA 200 

201 CAGAGACCCCCACI7U3GTATC 235 

IIMIMM III II I mill II Mllllll 
201 CAGAGAOXCCACTAAGTAOT 235 

FIG. 11C 



HHAS3 1 SYPGCVQCISGPDGMCTNSLl^ 50 

1 1 1 1 1 1 1 1 II I II II I MM III Ml IIMIII MMM I II IIIIII 1 1 
MHas3 1 SYFGCVQCISGPDGMYKNSU^ 50 

• * 
51 L^DGYRTKyTARSKCLTETPTKYIi?WI^ 78 

MM I Mill MM M IMMM I M II 
51 LSLGYOTTOTARSKCLTETPTRYL^^ 78 

FIG. 11D 
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1 GTCTTATTTT GGGTGTCTTC AGTGCATTAG TOGACCTCTC GGAATCTACA 
51 GAAACTCCTT GTTGCATGAG TITGTGGAAG ATTOGTACAA TCAAGAATTT 
101 ATOGGCAACC AATGTAGCTT TGGTGATGAC AGGCATCTCA CGAACCGGGT 
151 GCTCAGCCTC GGCTATCCAA CAAAATACAC AGCTCGATCT AAGTGCCITA 

TAAAC 




FIG. 1 



HHAS2 1 GTCTTATITrcGGTGTC^ 31 

MllllflMM Mill NIIIIJI If 
MHas2 1301 atatagaaagggcctgccagtcttattttggctgtgtccagtgcataagc 1350 

32 GGAiCXTICKXXaAATGTACAGAAACTC 81 
II I M Mill I III id H iff i If MM llillll MIM1MIM 
1351 ggtcctctgggaatgtacagaaactccttgctgcatgaatttgtggaaga 1400 

82 TTOCTACAATCAAGAATTT^^ 131 
Illllllllll Mill Mill IMMIII 1 1 llllllll I I II 
1401 ctggtacaatcaggaattcatgggtaaccaatgcagttttggtgacgaca 1450 

132 QGC&TCTCJUXPJtfX^ 181 
Nil II II III Mill 1 1 1 1 M1IIIMMIIII lllllill 
1451 ggcaccttaccaacagggtgttgagtctgggctatgcaactaaatacacg 1500 

182 GCTCGATCTAAGTGCCTTACTGAAACACCT 231 
Mill M 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II Ml Ml Mill IMMIII 
1501 gctcggtccaagtgccttactgaaactcccatagaatatctgagatggct 1550 

232 AAAC ! \ \ \ \ 235 

Ml 

1551 gaaccagcagacccgatggagcaagtcctacttccgagagtggctgtaca 1600 



FIG. 10B 



HHAS2 1 •••SYPKVQCISGPlJGMraNSUiffiFVEI^ 29 

I M II 1 1 1 II M M II I II II I M M I M 
MHOS2 251 VTCSFLSSVRYWMAFNIERAC^^ 300 

30 NQEFMSNQCSFGDDRHLT^^ . 78 

I 1 1 M I M M M 1 1 M M I M 1 1 M 1 1 M II I M M I M M M I M 1 1 1 
301 WEFMZWZCSFGDDBHL^^ 350 
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MHas2 -MHCERFLCVLR- 1 IGTTI* FGVSLLLG I TAVY IVGY QF IQTDKYYFSFGLYQAFL 

MHasl MRQOMPKPS EAARCCS GIAJWJLT 1 1 FAIXIZX3UaHAyAAGN^I^DRYGIXAPC2LYaAFL 

DG42 MKE KAAETME I PEG Z PKDLE PKH PTLWRI X YYS FGWLtAT ITAXYVAE F\JVLKHEAI LFSLGLYGLAM 

HaftA MPIFTOCTLIVLSFIFLISILIYLNMYLEX5TST VGI-YGVILITYLVXKL QLSF 

NodC MYIXOTTSTAAIS I - YA1XLTAYRSMQVLYARPIDGIAV 



MHas2 ASHLI IQSLFAFLEHRKM KKSLETPIKLMKT VALCIAAYQEDPDYLRKCLQSVKRLTYPGIK 

MHasl S AHLVAQS LFAYLE HRRVAAAARRS LAKGP LDAATARSVALT I SAYQEDPAYLRQCLTSARALLYPHTR 

DG4 2 LLH1MMQSLFAFLE IRRV MKSEL-PCSFKKT VALTXAGYQEOTEYLIKCLESClOn/KYPKDK 

HasA LYEPFKGNPHDY KVAAVT PSYHK DA ESTiT lETLKSVLAQTYPLS - 

-AVDVIVPSFKEDPGILSAC1ASIADQDYPGE' 




MUaa2 

MHa&l 

DG42 

HasA 

HodC 



— WMVIDGHSDODI 
IJWUf;VDGNRAEDLYMVI*4FR^^ 
UCI XLVXDGNTEDDAY164EMFKDVFHGEDVG1!YVWK^ 

-EIYIVDDGSSNTQA2QL IEEYVHRE VDICRNVIVHRS — 

LRVYWDOGSRHREAIVR VRAFYSBD PRFSFILLPE 



MHa«2 LVZ£NKSICXMQKHGGKR£VMYTAFRAX£RSVDYV(^ 

MHasl VEALVRTRRCVCVAQRRGGKMV^ 

DG42 VEELVWWRCVCIMQQWa-KREVMY^^ 

HasA LVNK G-KRHAQAHAFERSOADy- F1»TV-DSDTY rYPNALEELI^FNDETVYAATG 

ModC NV ^-KRK^IAAIGQSSGDL-VUW-DSDSTIAFDWSKIA^ 

* * * * * * * * «- ** 

MHas2 GDVQIUBCYDSWISFI^SVRYHMAF^ 

MHasl CTVRII^PIJ^SWVSFI^SUUfWAFMVER^ 

DG42 GDVRII21PYDSFISFMSSUITO1AFKVERA^ 

HasA - HLKAPKPQTNLLTRLTO IRYDMAFGVERAXQSLTGNILVCSGPLS XYRREVT I PNtERYKMQTFLGLP 

HodC ^LTASNSGDTra.TKLIDMEYTOACNEERA^ 

* * ** * ** ** 

MHas2 MFGDDRHLTNRVI^LGYArKYTARSffi 

MHaal CTFGDDRHLTNRMLSMGYATKYTSRS RCYSETPS S FIi^HI*SQQTRySKSYFREVn^fKAIJrWHRHHA 

0G42 CTI/jDDRHL11IRVI*SMGYRnCYTHKSRAFSE 

HasA VSIGDDRCLTMYAIDLG-RIVYQSTARCDTl^ 

NodC SDFGEDRHLTIIKLKAGFRTEYVPDAIVATVV^ PL — L 

* * * * * * * * 

MHas2 W MTYEAVI TGFFPFFLIATVIQIiFYRflKl- — KHIIJJTiLTVQLVGLXKSSFASCLR^rV 

MHasl W MTYEAW SGLFPFFVAATVLRIiFYAGRP — HAIXWVTXCVQGVAIAKAAFAAWLRGCVR 

DG42 K MIYESW SFIFPFFITATVIia.IYAaTI--WNVVWLIJ^ 

HasA RTI FEVVMFMMLIVAIGNIJ*FNOAJQIX)LIKI*F^ IFIVALC R --NVHYMVKHPAS 

HodC RGLS PFLAFDAVGQHI GQUXAJ*SVVrGIAHLI>frATVP^?WT ILI IA-C MTIIRCSVVALHAROLR 

* * * ** ***** 

MHas2 MVFMSLYS\tt.YMSSU^AKMFAIAT^^ 

MHasl MVIXSLYAPLYMMIXPAKFIALVTWWQSGWGTSGRK^ 

DG4 2 MIXMSLYSMLYMTCnXPSKYFAJXTLNKTGTOTS^^ 

HasA FIX£PLYGIXJiLFVLQPLKLYSLCTXKNTEftGTO KKVTIFK* 

ModC FLGFVLHTP IMLFL XLPLKAYA1CTLSMSOHLSR YSAPEVPV8 GGKQTPIQT- - 



MHas2 
MHasl 
OG42 
HasA 

ModC 



KKPFSES-KQ TVLIVGTLIYACYRVMIXTLYVVLINKCGRRKKGQQ^ OMVLDV* 

RADHSGPSRAAEAYHXAAGAaAYVAYHVVMLTrYWVGVRRX 

Qh©WSTPEKQK£MYHIJ,YGCVGyVMYWV PDM — CV* 
6GRVTPDCTCSGE* 
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1 acatgcaagaagaaggagaagccaaggcgtctggaaagaattacccagtcctggcttcgagcagcccattga 
73 acgggggacccgaaccagccaaagacttcttcattctgctcctgctagactctgctgagccttgacccggcc 
145 tgtaggttgatgtgaaaagagattctgtgtcgtcggagggaaggggattggagcaaatagcaaaacaggggg 

III zzzzzzzz^^ 

III ssszs^^ 23 

MHCERFLCVLRIIGTTLFGVS L L 23 

577 CTC GG AATC AC AGCTGCTTATATTGTTGGC T AC C AGTTTATC C AAACAGAT AATTAC TAC TTCTC ATTTGG A 

r q t TAAY IVGYQFIQTD N Y Y F 5 F G 4 / 

649 CICTACTCTGCCrmAGCre 

TYGAFLASHLIIQSLFAFLEHRKM 71 

72X AAGAAGTCCCTO^CCCCGATTAAATTCAACAA^ 

vvSLETPIKLNKTVALClAAYQED 9b 

793 CCTG ACT ACTTACGG AAATGTTTGC AATC TGTG AAAAGGCTG AC C TAC C CTGGG A^ AAAGTCGTG ATGGTC 

PDY LRKC LQSVKRLTYPG IKvVMV 119 

865 ATCGATGGGAACTCAGACGACGACCTTTACATGATGGACATA 

IDGNSDDDLYMMDIFSEVIGRDKS143 

937 GCCACGTACATCTGGAAGAACAACTTTCATGAAAAGGGACCTGGTGAGAC 

ATYIWKNNFHEKGPGETEESHKES167 

1009 TC AC AAC ATGTC AC C C AATTGGTCTTG TC T AACAAAAGTATTTGCATC ATGC AAAAAT^ 

<z o hvtqlvlsnks ic imqkwggkk 1*1 
loai gaag^tacacacktci^ 215 

1225 GGAGATGTCCAGATTTTAAACAAGTATGATTCCTG<^TCTCCTTCCT ^ 

12 97 TTTAATATAGAAAGGGCCTGCCAGTCTTATTTTGGCTGTGTCCAGTGCATAAGC<^ 

PNTERACQSYFGCVQCISGPLG H Y 2B7 

1369 AGAAACTCCTOCTCCATGAATTTGT^^ 

RNSLLHEFVEDWYNQEFMGNQCSF311 

1441 GGTGACGACAGGCACCTTACCAACAGGGTGTTGAGTCTGGGCTATGCAACTAAATA^ 

PDDRHLTNRVLSLGYATKYTARSK 

1513 TGCCTTACTGAAACTCCCATAGAATATCTGAGATGGCTGAACCAGCAGACCCGATGGAGCAAGTCCTAC^ 
CLTETPI EYLRWLNQQTRWSKSir 

1585 CGAGAGTGGCTGTACAATGCCATGTGGTTTCACAAGCATCACCTGTGGATGACCTATGAAGCTGTTATCACT 

REWLYNAMWFHKHHLWMTYEAV I T 383 

1657 GGATTCTTTCCTTTCTTTCTCATTGCCACAGTCATCCAGCTCTTCTA ^ 

1729 CTCTTCCTGTTAACTGTCCAGCTAGTGGGTCTCATCAAGTC 

LPLLTVQLVGLIKSSFASCLRGN1 «Ji 

1801 GTCATGGTATTCATGTCTCTGTATTCAGTGTTATACATGTCAAGTCT^ 

VMVFMSLYSVLYM SSLLPAKMFAi «33 
!873 GC AAC CATAAACAAAGCTGGGTGGGGCAC ATCTGGAAGGAAG ACC ATTGTTGT^AATTTC ATAGGACTTATT 

ATINKAGWGTSGRKTlVVNFIGi-J. 

1945 CCAGTGTCCGTGTGGTTTACAATCCTTCTAGGTGGTGTAATTTO 

PVSVWFT1LLGGVIFTIYKESK K 3UJ 

2017 TTTTCCGAATCCAAACAGACTGTTCTCATCGTGGGAACTTTC 

FSESKQTVLIVGTLIYACYWVMLL527 

2089 ACTCTCTATGTGGTTCTCATCAATAAGTGTGGCAGGCGGAAGAA 

TLYVVLINKCGRRKKGQQYDMVLD551 
2161 GTA TGAtaatiatttqtagtcacacctggag flCflCflCaCflC fl Cnrnr fl r . cacarflfTflC a cacac cttagctc ^ 

2232 ctcaaggggctatacagtattgtggcaccgcaccctgccaccacaggagacacatcactgctgccgggactt 

2304 gaacaaagacattcaatgggggttggtttctettctattct^ 

2376 aagtccgattaaatctgacagttttaggacggtgggatgatgtcttggcttatgcacttttcccctactgtg 

2448 catccgcctgacagtgtttgttctaaatacctcacttgccatgctttgtgtgggtgatcatggaagaaaagg 

2520 attctgaaaactcaagggaecgttctttcaacctacacatcctaacte^^ 

2592 cttcctttctatttttcgtttttaaggaaaattgttcatctttaccaaatgaaatgccaaaggaaagttgga 

2664 aagccactggctatgctgtattttgatataataattg^ 

2736 aacaaaatttcacaccatagtctatattttacttctctggcaaaatacacttttgttcttt rftr . AtatataL 

2808 Atrafcatatat^raataaa ataqgctctaaaaaaatccatactataaaaaaaaattaaectgcccaaaacgtg 

2880 aaacgtggttgactgatgttcatgaaagflajiaaaatgtctccctctttctctacattttaaaaaaaaa 
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44. A method to prepare hyaluronan, comprising: contacting an amount of 
isolated hyaluronan synthase-2 with a mixture of components under 
conditions effective to yield hyaluronan. 

45. The method of claim 44 wherein the hyaluronan synthase-2 is obtained 



by the method of claim 20. 




isolated hyaluronan synthase-3 with a mixture of components under 
conditions effective to yield hyaluronan. 



47. The method of claim 44 wherein the hyaluronan synthase-3 is obtained 
by the method of claim 22. 
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synthase-2 DNA is indicative of the presence of the condition in 
said mammal. 

39. A method for detecting a condition associated with aberrant hyaluronan 
synthesis or extracellular accumulation, comprising: 

(a) contacting an amount of DNA obtained by reverse transcription 
of RNA from a mammalian physiological sample which 
comprises cells suspected of containing hyaluronan synthase-3 
RNA, with an amount of at least two oligonucleotides under 
conditions effective to amplify the DNA by a polymerase chain 
reaction so as to yield an amount of amplified hyaluronan 
synthase-3 DNA, wherein at least one oligonucleotide is an 
hyaluronan synthase-3-specific oligonucleotide; and 

(b) detecting the presence or amount of the amplified hyaluronan 
synthase-3 DNA, wherein the presence or amount of hyaluronan 
synthase-3 DNA is indicative of the presence of the condition in 
said mammal. 

40. The method of claim 36, 37, 38 or 39 wherein the physiological sample is 
a tissue sample. 

4 1 . The method of claim 36, 37, 38 or 39 wherein the physiological sample is 
a fluid. 

42. A therapeutic method, comprising: administering to a mammal an 
amount of an agent effective to alter native hyaluronan synthase-2 
activity in said mammal. 

43. A therapeutic method, comprising: administering to a mammal an 
amount of an agent effective to alter native hyaluronan synthase-3 
activity in said mammal. 
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conditions effective to amplify the DNA by a polymerase chain 
reaction so as to yield an amount of amplified hyaluronan 
synthase-2 DNA, wherein at least one oligonucleotide is an 
hyaluronan synthase-2-specific oligonucleotide; and 



(b) detecting the presence or amount of the amplified hyaluronan 
synthase-2 DNA. 




(a) contacting an amount of DNA obtained by reverse transcription 
of RNA from a mammalian physiological sample which 
comprises cells suspected of containing hyaluronan synthase-3 
RNA, with an amount of at least two oligonucleotides under 
conditions effective to amplify the DNA by a polymerase chain 
reaction so as to yield an amount of amplified hyaluronan 
synthase-3 DNA, wherein at least one oligonucleotide is an 
hyaluronan synthase-3 -specific oligonucleotide; and 

(b) detecting the presence or amount of the amplified hyaluronan 
synthase-3 DNA. 

38. A method for detecting a condition associated with aberrant hyaluronan 
synthesis or extracellular accumulation, comprising: 

(a) contacting an amount of DNA obtained by reverse transcription 
of RNA from a mammalian physiological sample which 
comprises cells suspected of containing hyaluronan synthase-2 
RNA, with an amount of at least two oligonucleotides under 
conditions effective to amplify the DNA by a polymerase chain 
reaction so as to yield an amount of amplified hyaluronan 
synthase-2 DNA> wherein at least one oligonucleotide is an 
hyaluronan synthase-2-specific oligonucleotide; and 

(b) detecting the presence or amount of the amplified hyaluronan 
synthase-2 DNA, wherein the presence or amount of hyaluronan 
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33. A method to prevent or treat a condition associated with an alteration in 
hyaluronan synthesis or extracellular accumulation, comprising: 
administering to a mammal afflicted with, or at risk of, said condition an 
amount of mammalian hyaluronan synthase-3 effective to alter 
hyaluronan synthesis or extracellular accumulation, 

34. A method to identify a mammal afflicted with, or at risk of, a condition 
associated with aberrant hyaluronan synthesis or extracellular 
accumulation, comprising: 

(a) contacting an agent that binds to mammalian hyaluronan 
synthase-2 with a mammalian sample suspected of containing 
hyaluronan synthase-2 so as to form a complex; and 

(b) detecting or determining the presence or amount of complex 
formation and correlating the presence or amount of complex 
formation with the presence or absence of the condition. 

35 T A method to identify a mammal afflicted with, or at risk of, a condition 
associated with aberrant hyaluronan synthesis or extracellular 
accumulation, comprising: 

(a) contacting an agent that binds to mammalian hyaluronan 
synthase-3 with a mammalian sample suspected of containing 
hyaluronan synthase-3 so as to form a complex; and 

(b) detecting or determining the presence or amount of complex 
formation and correlating the presence or amount of complex 
formation with the presence or absence of the condition. 

36, A method for detecting hyaluronan synthase-2 DNA, comprising: 

(a) contacting an amount of DNA obtained by reverse transcription 
of RNA from a mammalian physiological sample which 
comprises cells suspected of containing hyaluronan synthase-2 
RNA, with an amount of at least two oligonucleotides under 
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(b) expressing the preselected DNA segment in the transformed host 
cell in an amount that alters the amount of hyaluronan produced 
by the transformed cell relative to the amount of hyaluronan 
produced by a corresponding untransformed cell. 



The method of claim 24 or 25 wherein the amount of hyaluronan 




of hyaluronan produced by the corresponding untransformed host eel 



27. The method of claim 24 or 25 wherein the amount of hyaluronan 

produced by the transformed host cell is decreased relative to the amount 
of hyaluronan produced by the corresponding untransformed host cell. 



28. Isolated, purified hyaluronan synthase-2 polypeptide, or a biologically 
active subunit or variant thereof. 



29. The hyaluronan synthase-2 polypeptide of claim 28 having SEQ ID 
NO:2. 



30. Isolated, purified hyaluronan synthase-3 polypeptide, or a 
biologically active subunit or variant thereof. 

3 1 . The hyaluronan synthase-3 polypeptide of claim 3 1 having SEQ ID 
NO:32. 



32. A method to prevent or treat a condition associated with an alteration in 
hyaluronan synthesis or extracellular accumulation, comprising: 
administering to a mammal afflicted with, or at risk of, said condition an 
amount of mammalian hyaluronan synthase-2 effective to alter 
hyaluronan synthesis or extracellular accumulation. 
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20. A method to produce hyaluronan synthase-2, comprising: culturing a host 
cell transformed with a nucleic acid molecule comprising a DNA 
segment encoding hyaluronan synthase-2 operably linked to a promoter, 
so that said host cell expresses said hyaluronan synthase-2, 

21 . The method of claim 20 further comprising isolating hyaluronan 
synthase-2 from the host cell. 



22. A method to produce hyaluronan synthase-3, comprising: culturing a host 
cell transformed with a nucleic acid molecule comprising a DNA 
segment encoding hyaluronan synthase-3 operably linked to a promoter, 
so that said host cell expresses said hyaluronan synthase-3. 

23. The method of claim 22 further comprising isolating hyaluronan 
synthase-3 from the host cell. 



24. A method of altering the amount of hyaluronan produced by a cell, 
comprising: 

(a) introducing into a host cell a preselected DNA segment encoding 
hyaluronan synthase-2 operably linked to a promoter functional in 
the host cell so as to yield a transformed host cell; and 

(b) expressing the preselected DNA segment in the transformed host 
cell in an amount that alters the amount of hyaluronan produced 
by the transformed cell relative to the amount of hyaluronan 
produced by a corresponding untransformed cell. 



25. A method of altering the amount of hyaluronan produced by a cell, 
comprising: 

(a) introducing into a host cell a preselected DNA segment encoding 
hyaluronan synthase-3 operably linked to a promoter functional in 
the host cell so as to yield a transformed host cell; and 
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10. The DNA molecule of claim 8 wherein the preselected DNA segment 
encodes a hyaluronan synthase-3 having SEQ ID NO:32. 

1 1 . The DNA molecule of claim 8 wherein the preselected DNA segment 
comprises SEQ ID NO:31. 




13. The DNA molecule of claim 8 wherein the preselected DNA segment 
comprises SEQ ID NO:25. 

14. The DNA molecule of claim 8 wherein the preselected DNA segment 
encodes a polypeptide comprising SEQ ID NO:29. 

1 5. A primer or a probe, having at least about 1 5 nucleotides, wherein the 
primer or probe has at least about 80% identity to the DNA molecule of 
claim 8. 

16. An expression cassette comprising a promoter operably linked to a 
preselected DNA segment encoding hyaluronan synthase-2. 

1 7. An expression cassette comprising a promoter operably linked to a 
preselected DNA segment encoding hyaluronan synthase-3. 

1 8. A host cell, the genome of which is augmented by a preselected DNA 
segment encoding hyaluronan synthase-2. 

19. A host cell, the genome of which is augmented by a preselected DNA 
segment encoding hyaluronan synthase-3. 
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WHAT IS CLAIMED IS: 

1 . An isolated and purified DNA molecule comprising a preselected DNA 
segment encoding hyaluronan synthase-2, a biologically active variant 
thereof or a biologically active subunit of the variant. 

2. The DNA molecule of claim 1 wherein the preselected DNA segment 
encodes murine hyaluronan synthase-2. 

3. The DNA molecule of claim 1 or 2 wherein the preselected DNA 
segment encodes a hyaluronan synthase-2 having SEQ ID NO:2, 

4. The DNA molecule of claim 1 wherein the preselected DNA segment 
comprises SEQ ID NO:l. 

5. The DNA molecule of claim 1 wherein the preselected DNA segment 
encodes human hyaluronan synthase-2. 

6. The DNA molecule of claim 1 or 5 wherein the preselected DNA 
segment comprises SEQ ID NO:23. 

7. An isolated and purified DNA molecule comprising SEQ ID NO: 1 . 

8. An isolated and purified DNA molecule comprising a preselected DNA 
segment encoding hyaluronan synthase-3, or a biologically active subunit 
or variant thereof 



9. 



The DNA molecule of claim 8 wherein the preselected DNA segment 
encodes murine hyaluronan synthase-3. 
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GTGCGTGGCG 
CGGAGATTCG 
ACTGCTGGAG 
TGTGCGGATC 
GGTAGCCTTC 
CGGTCCTCTA 
CCAGAAGTTC 
GCTCAGCATG 
CTCGTCCTTC 
GTGGCTGTAC 



CAGCGCTGGG 
GTGGACTACG 
CTCGTGCGGG 
CTTAACCCTC 
AATGTGGAGC 
GGCCTATATA 
CTGGGTACCC 
GGTTATGCTA 
CTGCGGTGGC 
AACGCGCTCT 



GCGGCAAGCG 
TGCAGGTCTG 
TACTGGACGA 
TGGACTCCTG 
GGGCTTGTCA 
GGAATAACCT 
ACTGTACTTT 
CCAAGTACAC 
TGAGCCAGCA 
GGTGGCACCG 



CGAGGTCATG 
TGACTCGGAC 
GGACCCCCGG 
GGTCAGCTTC 
GAGCTACTTC 
CTTGCAGCAG 
TGGGGATGAC 
CTCCAGGTCC 
GACACGCTGG 
GCACCATGCG 
:CACTGTG 



TACACAGCCT 
ACAAGGTTGG 
GTAGGGGCTG 
CTAAGCAGCC 
CACTGTGTAT 
TTTCTTGAGG 
CGGCACCTCA 
CGCTGCTACT 
TCCAAGTCGT 
TGGATGACCT 
CTGCGTCTGT 



TCAAGGCGCT 
ACCCCATGGC 
TTGGTGGGGA 
TGCGATACTG 
CCTGCATCAG 
CCTGGTACAA 
CCAACCGCAT 
CAGAGACGCC 
ACTTCCGTGA 
ACGAGGCGGT 
TCTACGCGGG 



720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 




GGCCTTO 
CCTCTACATG 
TGGCTGGGGC 
GGCGCTCTGG 
GGCCGACTGG 
CGCCTACGTG 
TTGCCGGCGG 
CCTCAAGGGT 
TGGGTGGTTC 
TCTTCAGTCT 
GTCAGGATGT 
TTAATCTCCA 
AAAAAAAA 



TGTGGCCTCC 
ACCTCGGGCC 
GCGCTGCTGC 
AGCGGCCCTT 
GGCTACTGGG 
CGGACCGGGG 
CTTCAGGGGA 
TCTGGGCCTC 
GGACTATATT 
GGGATTTGAG 
TTTCTACTGT 



TGCCTGCCAA 
GGCGGAAGCT 
TGCTTGGGGG 
CCCGCGCAGC 
TGGCCATGTT 
GCTACCGCGT 
GGCCAGAGGA 
AGTTTCCCTC 
GGGACTGGGA 
GAGTGGAGGG 
GTGATCAGGA 



GTTCCTGGCG 
GGCCGCTAAC 
CCTGGTCCGC 
CGAGGCCTAC 
GACGCTGTAC 
CCAGGTGTGA 
GAGCTGCTGG 
CTCTGCCAAA 
CTTCTGGGTC 
GAAGGGGTCC 
TGTAATAAAG 



CTAGTCACCA 
TACGTCCCTC 
AGCGTAGCAC 
CACTTGGCCG 
TGGGTGGGCG 
GTCCAGCCAC 
GCCCCGAGCC 
CGAGGGGGTC 
TCCAGGGAGG 
TGCTTTCTCC 
AATTTTATTT 



TG AAC C AG AG 
TGCTGCCCCT 
ACGAGGCCAG 
CGGGGGCCGG 
TGCGGAGGCT 
GCGGATGCCG 
ACGAACTTGC 
AGCCCAAGAT 
GXATTTATTG 
TCGTTCTTAT 
ATTTTCAAAA 



1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2108 



(2) INFORMATION FOR SEQ ID NO: 56: 

< i ) S EQUENCE CHARACTER ISTICS : 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Asn Met Tyr Leu Ala Glu Asp Arg lie Leu 
15 10 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi} SEQUENCE DESCRIPTION: SEQ ID NO: 57: 



Asn Gin Cys Ser Phe Gly Asp Asp Arg His 
15 10 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:52: 
TAGC CATCTG AGATATTCTA TAGGT 25 
(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 
£B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 
GTCAGAGCTA CTTCCACTGT G 21 
(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 21 base pairs 

( B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 
AAGGAGGAGG GCGTCTCCGA G 21 
(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2108 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: mRNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

GAATTCCGGG CGCCCGGGAC TCACGCCCCT TCCTTTCCCC TCTCGCTCCC AGCAGGACGC 60 

GCCCAAGCCC ACTCCTGCAG CCCGCCGCTG CTCCGGCCTG GCCCGGAGGG TGCTGACCAT 120 

CGCCTTCGCC CTGCTCATCC TGGCCCTCAT GACCTGGGCC TACGCCGCCG GGGTGCCGCT 180 

GGCCTCCGAT CGCTACGGCC TCCTGGCCTT CGGCCTCTAC GGGGCCTTCC TTTCAGCGCA 240 

CCTGGTGGCG CAGAGCCTCT TCGCGTACCT GGAGCACCGG CGGGTGGCGG CGGCGGCGCG 300 

GGGGCCGCTG GATGCAGCCA CCGCGCGCAG TGTGGCGCTG ACCATCTCCG CCTACCAGGA 360 

GGACCCCGCG TACCTGCGCC AGTGCCTGGC GTCCGCCCGC GCCCTGCTGT ACCCGCGCGC 420 

GCGCGTGCGC GTCCTCATGG TGGTGGATGG CAACCGCGCC GAGGACCTCT ACATGGTCGA 480 

CATGTTCCGC GAGGTCTTCG CTGACGAGGA CCCCGCCACG TACGTGTGGG ACGGCAACTA 540 

CCACCAGCCC TGGGAACCCG CGGCGGCGGG CGCGGTGGGC GCCGGAGCCT ATCGGGAGGT 600 

GGAGGCGGAG GATCCTGGGC GGCTGGCAGT GGAGGCGCTG GTGAGGACTC GCAGGTGCGT 660 
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35 40 
Trp Ala Leu Gly Ser Val Glu He Phe 
50 55 

(2) INFORMATION FOR SEQ ID MO: 49: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS: single 



45 




(ii) MC 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 
GTGCTTCTGT CTCTCTACGC G 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
CCAGTCCCAA TATAGTCCAG ACTG 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
GGTGTGTTCA GTGCATTAGT GGA 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



21 



24 



23 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 

Pro Ser Asp Phe Gly Glu Asp Arg His Leu Thr lie Leu Met Leu Lys 

15 10 15 

Ala Gly Phe Arg Thr Glu Tyr Val Pro Asp Ala lie Val Ala Thr Val 

20 25 30 

Val Pro Asp Thr Leu Lys Pro Tyr Leu Arg Gin Gin Leu Arg Trp Ala 

35 40 45 

Arg Ser Thr Phe Arg Asp Thr 

50 55 



(2) INFORMATION FOR SEQ ID NO: 47: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

Lys Ala Gly Ala Glu Asn Ala Leu Val Arg Val Ser Ala Val Leu Thr 

15 10 15 

Asn Ala Pro Phe lie Leu Asn Leu Asp Cys Asp His Tyr Val Asn Asn 

20 25 ' 30 

Ser Lys Ala Val Arg Glu Ala Met Cys Phe Leu Met Asp 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 48: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 amino acids 

(B) TYPE: amino acid 

{C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 



Tyr Gly Ser Val Thr Glu Asp lie 

1 5 
Arg Gly Trp Arg Ser lie Tyr Cys 
20 

Gly Ser Ala Pro lie Asn Leu Ser 



Leu Thr Gly Phe Lys Met His Cys 

10 15 
Met Pro Leu Arg Pro Ala Phe Lys 
25 30 
Asp Arg Leu His Gin Val Leu Arg 
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<A) LENGTH: 2 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO:43 
GTTGAGCCAC CGGAGGTACT TAG 




(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

Pro Val Ser He Gly Asp Asp Arg Cys Leu Thr Asn 

15 10 
Leu Gly Arg Thr Val Tyr Gin Ser Thr Ala Arg Cys 

20 25 
Pro Phe Gin Leu Lys Ser Tyr Leu Lys Gin Gin Asn 

35 40 
Ser Phe Phe Arg Glu Ser 
50 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids J 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

Asn Met Tyr Leu Ala Glu Asp Arg He Leu Cys Trp 

15 10 
Lys Arg Asp Ala Lys Trp Val Leu Lys Tyr Val Lys 

20 25 
Glu Thr Asp Val Pro Glu Asp Val Ser Glu Phe He 

35 40 
Arg Trp Leu Asn Cys Ala Met Phe Ala Ala 
50 55 



Tyr Ala He Asp 
15 

Asp Thr Asp Val 
30 

Arg Trp Asn Lys 
45 



Glu Leu Val Ala 
15 

Glu Ala Thr Gly 
30 

Ser Gin Arg Arg 
45 



(2) INFORMATION FOR SEQ ID NO: 46: 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 

His Cys Thr Phe Gly Asp Asp Arg His Leu Thr Asn Arg Met Leu Ser 

15 10 is 

Met Gly Tyr Ala Thr Lys Tyr Thr Ser Arg Ser Arg Cys Tyr Ser Glu 

20 25 30 

Thr Pro Ser Ser Phe Leu Arg Trp Leu Ser Gin Gin Thr Arg Trp Ser 

35 40 45 

Lys Ser Tyr Phe Arg Glu Trp 
50 55 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
TGTGCAGTGT AATTAGTGGG CCCT 24 
(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Tyr Cys Thr Leu Gly Asp Asp Arg His Leu Thr Asn Arg Val Leu Ser 

15 10 15 

Met Gly Tyr Arg Thr Lys Tyr Thr His Lys Ser Arg Ala Phe Ser Glu 

20 25 30 

Thr Pro Ser Leu Tyr Leu Arg Trp Leu Asn Gin Gin Thr Arg Trp Thr 

35 40 45 

Lys Ser Tyr Phe Arg Glu Trp 
50 55 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 
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Lys Cys Ser Phe Gly Asp Asp Arg His Leu Thr Asn Arg Val Leu Ser 

15 10 15 

Leu Gly Tyr Arg Thr Lys Tyr Thr Ala Arg Ser Lys Cys Leu Thr Glu 

20 25 30 

Thr Pro Thr Arg Tyr Leu Arg Trp Leu Asn Gin Gin Thr Arg Trp Ser 

35 40 45 

Lys Ser Tyr Phe Arg Glu Trp 
50 55 

(2) INFORMATION FOR SEQ ID NO: 37: 




(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 
GTCATCCAGA GGTGGTGCTT ATGG 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE : CDNA . 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

CCGAATTCAA GATGGCGGTG CAGCTGACTA CAGCC 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

CCGAATTCTC ACACCTCCGC AAAAGCCAGG C 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i> SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 55 amino acids 



24 



35 



31 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Arg Trp Leu Asn Gin Gin Thr Arg Trp 
1 5 

(2) INFORMATION FOR SEQ ID NO:34: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
TACTGGATGG CTTTCAACGT GGAG 24 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:35: 

Lys Arg Glu Val Met Tyr Thr Ala Phe Lys Ala Leu Gly Asn Ser Val 

15 10 15 

Asp Tyr lie Gin Val Cys Asp Ser Asp Thr Val Leu Asp Pro Ala Cys 

20 25 30 

Thr lie Glu Met Leu Arg Val Leu Glu Glu Asp 
35 40 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 55 amino acids 
(B J TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
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165 170 

Ala Val Val Trp Ala Ser Thr Phe Ser Cys lie Met 

180 185 

Gly Lys Arg Glu Val Met Tyr Thr Ala Phe Lys Ala 

195 200 

Val Asp Tyr lie Gin Val Cys Asp Ser Asp Thr Val 

210 215 220 

Cys Thr He Glu Met Leu Arg Val Leu Glu Glu Asp 

225 230 235 

Gly Val Gly Gly Asp Val Gin He Leu Asn Lys Tyr 



175 

Gin Lys Trp Gly 
190 

Leu Gly Asn Ser 
205 

Leu Asp Pro Ala 

Pro Gin Val Gly 
240 

Asp Ser Trp He 
5 




260 



Ala Cys Gin Ser Tyr Phe Gly Cys Val 

275 280 
Gly Met Tyr Arg Asn Ser Leu Leu Gin 

290 295 
His Gin Lys Phe Leu Gly Ser Lys Cys 
305 310 
Leu Thr Asn Arg Val Leu Ser Leu Gly 
325 

Arg Ser Lys Cys Leu Thr Glu Thr Pro 
340 345 
Asn Gin Gin Thr Arg Trp Ser Lys Ser 

355 360 
Asn Ser Leu Trp Phe His Lys His His 

370 375 
Val Val Thr Gly Phe Phe Pro Phe Phe 
385 390 
Leu Phe Tyr Arg Gly Arg He Trp Asn 
405 

Val Gin Leu Val Gly He lie Lys Ala 
420 425 
Gly Asn Ala Glu Met He Phe Met Ser 

435 440 
Ser Ser Leu Leu Pro Ala Lys He Phe 

450 455 
Ser Gly Trp Gly Thr Ser Gly Arg Lys 
465 470 
Gly Leu He Pro Val Ser He Trp Val 
485 

Ala Tyr Thr Ala Tyr Cys Gin Asp Leu 
500 505 
Phe Leu Val Ser Gly Ala He Leu Tyr 

515 520 
Leu Met Leu Tyr Leu Ala He He Ala 

530 535 
Glu Gin Tyr Ser Leu Ala Phe Ala Glu 
545 550 



Gin Cys He 

Gin Phe Leu 
300 

Ser Phe Gly 
315 

Tyr Arg Thr 
330 

Thr Arg Tyr 

Tyr Phe Arg 

Leu Trp Met 
380 

Leu He Ala 

395 
He Leu Leu 
410 

Thr Tyr Ala 

Leu Tyr Ser 

Ala He Ala 
460 

Thr He Val 

475 
Ala Val Leu 
490 

Phe Ser Glu 

Gly Cys Tyr 

Arg Arg Cys 
540 

Val 



Ser Gly 
285 

Glu Asp 

Asp Asp 

Lys Tyr 

Leu Arg 
350 
Glu Trp 
365 

Thr Tyr 

Thr Val 

Phe Leu 

Cys Phe 
430 
Leu Leu 
445 

Thr He 

Val Asn 

Leu Gly 

Thr Glu 
510 
Trp Val 
525 

Gly Lys 



Pro Leu 

Trp Tyr 

Arg His 
320 
Thr Ala 
335 

Trp Leu 

Leu Tyr 

Glu Ser 

He Gin 
400 
Leu Thr 
415 

Leu Arg 
Tyr Met 
Asn Lys 
Phe He 

480 

Gly Leu 
495 

Leu Ala 
Ala Leu 
Lys Pro 



(2) INFORMATION FOR SEQ ID NO; 33: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
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GGC CTA ATC CCC GTG TCC ATC TGG GTG GCA GTT CTT CTA GGG GGG TTA 1488 
Gly Leu He Pro Val Ser He Trp Val Ala Val Leu Leu Gly Gly Leu 
485 490 495 

GCC TAC ACA GCT TAT TGC CAG GAC CTG TTC AGT GAG ACC GAG CTA GCC 1536 
Ala Tyr Thr Ala Tyr Cys Gin Asp Leu Phe Ser Glu Thr Glu Leu Ala 
500 505 510 

TTC CTA GTC TCT GGG GCC ATC CTG TAT GGC TGC TAC TGG GTG GCC CTC 1584 
Phe Leu Val Ser Gly Ala He Leu Tyr Gly Cys Tyr Trp Val Ala Leu 
515 520 525 



CTC ATG CTG TAT CTG GCC ATT ATT GCC CGG AGG TGT GGG AAG AAG CCA 1632 
Leu Met Leu Tyr Leu Ala He He Ala Arg Arg Cys Gly Lys Lys Pro 
530 535 540 



GAA CAG TAT AGC CTG GCT TTT GCG GAG GTG TGA 1665 
Glu Gin Tyr Ser Leu Ala Phe Ala Glu Val 
545 550 



(2) INFORMATION FOR SEQ ID NO: 32: 



(i) SEQUENCE CHARACTERISTICS: 
(Aj LENGTH: 554 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
{v> FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 32: 



Met Pro 


Val 


Gin 


Leu 


Thr 


Thr 


Ala 


Leu 


Arg 


Val 


Val 


Gly Thr 


Ser 


Leu 


1 






5 










10 










15 




Phe Ala 


Leu 


Val 


Val 


Leu 


Gly 


Gly 


He 


Leu 


Ala 


Ala 


Tyr 


Val 


Thr 


Gly 






20 










25 










30 






Tyr Gin 


Phe 


He 


His 


Thr 


Glu 


Lys 


His 


Tyr 


Leu 


Ser 


Phe 


Gly 


Leu 


Tyr 




35 










40 










45 








Gly Ala 


He 


Leu 


Gly 


Leu 


His 


Leu 


Leu 


He 


Gin 


Ser 


Leu 


Phe 


Ala 


Phe 


50 










55 










60 










Leu Glu 


His 


Arg 


Arg 


Met 


Arg 


Arg 


Ala 


Gly 


Arg 


Pro 


Leu 


Lys 


Leu 


His 


65 








70 










75 










80 


Cys Ser 


Gin 


Arg 


Ser 


Arg 


Ser 


Val 


Ala 


Leu 


Cys 


He 


Ala 


Ala 


Tyr 


Gin 








85 










90 










95 




Glu Asp 


Pro 


Glu 


Tyr 


Leu 


Arg 


Lys 


Cys 


Leu 


Arg 


Ser 


Ala 


Gin 


Arg 


He 






100 










105 










110 






Ala Phe 


Pro 


Asn 


Leu 


Lys 


Val 


Val 


Met 


Val 


val 


Asp Gly Asn Arg 


Gin 




115 










120 










125 








Glu Asp 


Thr 


Tyr 


Met 


Leu 


Asp 


He 


Phe 


His 


Glu 


Val 


Leu Gly Gly Thr 


130 










135 










140 










Glu Gin 


Ala 


Gly 


Phe 


Phe 


Val 


Trp 


Arg 


Ser 


Asn 


Phe 


His 


Glu Ala Gly 


145 








150 










155 










160 


Glu Gly 


Glu 


Thr 


Glu 


Ala 


Ser 


Leu 


Gin 


Glu 


Gly 


Met 


Glu 


Arg 


Val 


Arg 
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TCC TTC CTG AGC AGT GTG AGG TAC TGG ATG GCT TTC AAC GTG GAG CGG 
Ser Phe Leu Ser Ser Val Arg Tyr Trp Met Ala Phe Asn Val Glu Arg 
260 265 270 



816 



GCC TGC CAG TCC TAC TTT GGC TGT GTG CAA TGT ATT AGT GGG CCT TTG 
Ala Cys Gin Ser Tyr Phe Gly Cys Val Gin Cys lie Ser Gly Pro Leu 
275 280 285 



864 



GGC ATG TAC CGC AAC AGC CTC CTT CAG CAG TTC CTG GAG GAT TGG TAC 
Gly Met Tyr Arg Asn Ser Leu Leu Gin Gin Phe Leu Glu Asp Trp Tyr 

5 300 



CA' 
His Gin 
305 



Lys Phe Leu 



Gly Ser 
310 



Lys Cys 



Ser Phe 
315 



Gly Asp 



Asp Arg His 
320 



912 




CTT ACC 
Leu Thr 



AAC 
Asn 



CGA GTC 
Arg Val 
325 



CTG AGT 
Leu Ser 



CTT GGC 
Leu Gly 



TAC CGG 
Tyr Arg 
330 



ACT AAG 
Thr Lys 



TAT ACA GCA 
Tyr Thr Ala 
335 



1008 



CGC TCT 
Arg Ser 



AAG 
Lys 



TGC CTC 
Cys Leu 
340 



ACA GAG 
Thr Glu 



ACC CCC 
Thr Pro 
345 



ACT AGG 
Thr Arg 



TAC CTT 
Tyr Leu 



CGA TGG CTC 
Arg Trp Leu 
350 



1056 



AAT CAG 
Asn Gin 



CAA 
Gin 
355 



ACC CGC 
Thr Arg 



TGG AGC 
Trp Ser 



AAG TCT 
Lys Ser 
360 



TAC TTT 
Tyr Phe 



CGG GAA 
Arg Glu 
365 



TGG CTC TAC 
Trp Leu Tyr 



1104 



AAT TCT 
Asn Ser 
370 



CTG 
Leu 



TGG TTC 
Trp Phe 



CAT AAG 
His Lys 
375 



CAC CAC 
His His 



CTC TGG 
Leu Trp 



ATG ACC 
Met Thr 
380 



TAT GAA TCA 
Tyr Glu Ser 



1152 



GTG GTC 
Val Val 
385 



ACA 
Thr 



GGT TTC 
Gly Phe 



TTC CCA 
Phe Pro 
390 



TTC TTC 
Phe Phe 



CTC ATT 
Leu lie 
395 



GCT ACA 
Ala Thr 



GTC ATA CAA 
val lie Gin 
400 



1200 



CTT TTC 
Leu Phe 



TAC 

Tyr 



CGT GGC 
Arg Gly 
405 



CGC ATC 
Arg lie 



TGG AAC 
Trp Asn 



ATT CTC 
He Leu 
410 



CTC TTC 
Leu Phe 



CTG CTA ACA 
Leu Leu Thr 
415 



1248 



GTG CAG 
Val Gin 



CTG 
Leu 



GTG GGC 
Val Gly 
420 



ATT ATC 
He He 



AAG GCT 
Lys Ala 
425 



ACC TAT 
Thr Tyr 



GCC TGC 
Ala Cys 



TTC CTT CGA 
Phe Leu Arg 
430 



1296 



GGC AAT 
Gly Asn 



GCA 
Ala 
435 



GAG ATG 
Glu Met 



ATC TTC 
He Phe 



ATG TCC 
Met Ser 
440 



CTC TAC 
Leu Tyr 



TCC CTT 
Ser Leu 
445 



CTC TAT ATG 
Leu Tyr Met 



1344 



TCC AGC CTC TTG CCA GCC AAG ATC TTT GCT ATT GCT ACC ATC AAC AAG 
Ser Ser Leu Leu Pro Ala Lys He Phe Ala He Ala Thr He Asn Lys 
450 455 460 



1392 



TCT GGC TGG GGC ACT TCT GGC AGG AAA ACC ATT GTC GTG AAC TTC ATT 
Ser Gly Trp Gly Thr Ser Gly Arg Lys Thr He Val Val Asn Phe lie 
465 470 475 480 



1440 
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TAC CAG TTT ATC CAC ACA GAA AAG CAC TAC CTG TCC TTT GGC CTC TAC 14 4 

Tyr Gin Phe lie His Thr Glu Lys His Tyr Leu Ser Phe Gly Leu Tyr 
35 40 45 

GGT GCC ATC CTG GGT CTA CAT CTG CTC ATC CAG AGC CTG TTT GCC TTC 192 
Gly Ala lie Leu Gly Leu His Leu Leu lie Gin Ser Leu Phe Ala Phe 
50 55 60 

CTG GAG CAC CGT CGA ATG CGC AGG GCA GGG CGC CCC CTC AAG CTG CAC 24 0 

Leu Glu His Arg Arg Met Arg Arg Ala Gly Arg Pro Leu Lys Leu His 
65 70 75 80 

TGC TCC CAG AGG TCG CGT TCA GTG GCA CTC TGC ATT GCT GCC TAC CAA 288 
Cys Ser Gin Arg Ser Arg Ser Val Ala Leu Cys lie Ala Ala Tyr Gin 
85 90 95 

GAG GAC CCC GAA TAC CTG CGC AAG TGC CTT CGC TCA GCT CAG CGC ATT 3 36 

Glu Asp Pro Glu Tyr Leu Arg Lys Cys Leu Arg Ser Ala Gin Arg lie 
100 105 110 

GCC TTT CCA AAC CTC AAG GTG GTC ATG GTA GTG GAT GGC AAT CGC CAG 384 
Ala Phe Pro Asn Leu Lys Val Val Met Val Val Asp Gly Asn Arg Gin 
115 120 125 

GAA GAT ACC TAC ATG TTG GAC ATC TTC CAT GAG GTG CTG GGT GGC ACT 4 32 

Glu Asp Thr Tyr Met Leu Asp lie Phe His Glu Val Leu Gly Gly Thr 
130 135 140 

GAG CAA GCT GGC TTC TTT GTG TGG CGT AGC AAT TTC CAT GAG GCG GGT 480 
Glu Gin Ala Gly Phe Phe Val Trp Arg Ser Asn Phe His Glu Ala Gly 
145 150 155 160 

GAA GGA GAG ACA GAG GCC AGC CTG CAG GAA GGC ATG GAG CGT GTG CGA 528 
Glu Gly Glu Thr Glu Ala Ser Leu Gin Glu Gly Met Glu Arg Val Arg 
165 170 175 

GCT GTG GTG TGG GCC AGC ACC TTC TCA TGC ATC ATG CAG AAG TGG GGG 576 
Ala Val Val Trp Ala Ser Thr Phe Ser Cys lie Met Gin Lys Trp Gly 

180 185 190 

GGC AAG CGT GAG GTC ATG TAC ACT GCC TTC AAG GCC CTT GGC AAC TCA 624 
Gly Lys Arg Glu Val Met Tyr Thr Ala Phe Lys Ala Leu Gly Asn Ser 
195 200 205 

GTG GAC TAC ATC CAG GTG TGT GAC TCT GAC ACT GTG CTG GAC CCA GCC 6 72 

Val Asp Tyr He Gin Val Cys Asp Ser Asp Thr Val Leu Asp Pro Ala 
210 215 220 

TGC ACC ATT GAG ATG CTT CGA GTC TTG GAA GAA GAT CCC CAA GTA GGA 720 
Cys Thr He Glu Met Leu Arg Val Leu Glu Glu Asp Pro Gin Val Gly 
225 230 235 240 

GGT GTT GGA GGA GAT GTC CAA ATC CTC AAC AAG TAT GAT TCA TGG ATC 768 
Gly Val Gly Gly Asp Val Gin He Leu Asn Lys Tyr Asp Ser Trp He 
245 250 255 
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<ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 



Ser Tyr Phe Gly Cys Val Gin Cys lie Ser Gly Pro Leu Gly Met Tyr 

1 5 10 15 

Arg Asn Ser Leu Leu Gin Gin Phe Leu Glu Asp Trp Tyr His Gin Lys 

20 25 30 

Phe Leu Gly Ser Lys Cys Ser Phe Gly Asp Asp Arg His Leu Thr Asn 
35 40 45 

r Lys 




Cys Leu Thr 
65 

Thr Arg Trp 

Trp Phe His 

Gly Phe Phe 
115 

Arg Gly Arg 

130 
Val Gly He 
145 

Glu Met He 
Leu Pro Ala 



Glu Thr Pro 
70 

Ser Lys Ser 
85 

Lys His His 
100 

Pro Phe Phe 

He Trp Asn 

He Lys Ala 
150 

Phe Met Ser 

165 
Lys He Phe 
180 



Thr Arg 

Tyr Phe 

Leu Trp 

Leu He 
120 
He Leu 
135 

Thr Tyr 
Tyr Leu 
Ala He 



Arg Glu 
90 

Met Thr 

105 

Ala Thr 

Leu Phe 

Ala Cys 

Ser Leu 
170 
Ala Thr 
185 



75 

Trp Leu 

Tyr Glu 

Val lie 

Leu Leu 
140 
Phe Leu 
155 

Leu Tyr 



Tyr 

Ser 

Gin 
125 
Thr 



Arg 
Met 
He Asn Lys 



Asn 

Val 
110 
Leu 

Val 

Gly 

Ser 

Ser 
190 



Ser 

95 

Val 

Phe 

Gin 

Asn 

Ser 
175 



80 
Leu 

Thr 

Tyr 

Leu 

Ala 
160 
Leu 



(2) INFORMATION FOR SEQ ID NO: 31: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 166 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

( D ) TOPOLOGY : unknown 



(ii) MOLECULE TYPE : cDNA 
(ix) FEATURE: 



(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 1...1662 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

ATG CCG GTG CAG CTG ACT ACA GCC CTG CGT GTG GTG GGC ACC AGT CTG 4 8 

Met Pro Val Gin Leu Thr Thr Ala Leu Arg Val Val Gly Thr Ser Leu 
15 10 15 



TTT GCC CTG GTA GTG CTG GGA GGC ATC CTG GCG GCC TAT GTG ACA GGC 96 
Phe Ala Leu Val Val Leu Gly Gly He Leu Ala Ala Tyr Val Thr Gly 
20 25 30 
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l 

Arg Asn Ser Leu 
20 

Phe Leu Gly Ser 
35 

Arg Val Leu Ser 
50 

Cys Leu Thr Glu 
65 



5 

Leu Gin Gin Phe 

Lys Cys Ser Phe 
40 

Leu Gly Tyr Arg 
55 

Thr Pro Thr Arg 
70 



10 

Leu Glu Asp Trp 
25 

Gly Asp Asp Arg 

Thr Lys Tyr Thr 
60 

Tyr Leu Arg Trp 
75 



15 

Tyr His Gin Lys 
30 

His Leu Thr Asn 
45 

Ala Arg Ser Lys 
Leu Asn 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 190 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

{ii> MOLECULE TYPE; peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 



Ser Tyr Phe Gly Cys Val Gin Cys lie Ser Gly Pro Leu Gly Met Tyr 

1 5 10 15 

Arg Asn Ser Leu Leu Gin Gin Phe Leu Glu Asp Trp Tyr His Gin Lys 

20 25 30 

Phe Leu Gly Ser Lys Cys Ser Phe Gly Asp Asp Arg His Leu Thr Asn 

35 40 45 

Arg Val Leu Ser Leu Gly Tyr Arg Thr Lys Tyr Thr Ala Arg Ser Lys 

50 55 60 

Cys Leu Thr Glu Thr Pro Thr Lys Tyr Leu Arg Trp Leu Asn Gin Gin 
65 70 75 80 

Thr Arg Trp Ser Lys Ser Tyr Phe Arg Glu Trp Leu Tyr Asn Ser Leu 

85 90 95 

Trp Phe His Lys His His Leu Trp Met Thr Tyr Glu Ser Val Val Thr 

100 105 110 

Gly Phe Phe Pro Phe Phe Leu lie Ala Thr Val He Gin Leu Phe Tyr 

115 120 125 

Arg Gly Arg lie Trp Asn He Leu Leu Phe Leu Leu Thr Val Gin Leu 

130 135 140 

Val Gly He He Lys Ala Thr Tyr Ala Cys Phe Leu Arg Gly Asn Ala 
145 150 155 160 

Glu Met He Phe Met Ser Tyr Leu Ser Leu Leu Tyr Met Ser Ser Leu 

165 170 175 

Leu Pro Ala Lys He Phe Ala He Ala Thr He Asn Lys Ser 
180 185 190 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 190 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 
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CGCGCGCTCC AAGTGCCTCA CAGAGACCCC CACTAAGTAC CTCCGGTGGC TCAAC 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 235 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

( D ) TOPOLOGY : unknown 



235 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

GTCCTACTTT GGCTGTGTGC AATGTATTAG TGGGCCTTTG GGCATGTACC GCAACAGCCT 
C CTTC AG C AG TTCCTGGAGG ATTGGTACCA TCAGAAGTTC CTAGGCAGCA AGTGCAGCTT 
TGGGGATGAT CGGCACCTTA CCAACCGAGT CCTGAGTCTT GGCTACCGGA CTAAGTATAC 
AGCACGCTCT AAGTGCCTCA CAGAGACCCC CACTAGGTAC CTTCGATGGC TCAAT 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION ; SEQ ID NO: 27: 

Ser Tyr Phe Gly Cys Val Gin Cys lie Ser Gly Pro Leu Gly Met Tyr 

15 10 15 

Arg Asn Ser Leu Leu Gin Gin Phe Leu Glu Asp Trp Tyr His Gin Lys 

20 25 ( 30 

Phe Leu Gly Ser Lys Cys Ser Phe Gly Asp Asp Arg His Leu Thr Asn 

35 40 45 

Arg Val Leu Ser Leu Gly Tyr Arg Thr Lys Tyr Thr Ala Arg Ser Lys 

50 55 60 

Cys Leu Thr Glu Thr Pro Thr Lys Tyr Leu Arg Trp Leu Asn 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 
Ser Tyr Phe Gly Cys Val Gin Cys lie Ser Gly Pro Leu Gly Met Tyr 



60 
120 
180 
235 
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(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 235 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : unknown 
{ D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

GTCTTATTTT GGGTGTGTTC AGTGCATTAG TGGACCTCTG GGAATGTACA GAAACTCCTT 6 0 

GTTGCATGAG TTTGTGGAAG ATTGGTACAA TCAAGAATTT ATGGGCAACC AATGTAGCTT 12 0 

TGGTGATGAC AGGCATCTCA CGAACCGGGT GCTGAGCCTG GGCTATGCAA CAAAATACAC 180 

AGCTCGATCT AAGTGCCTTA CTGAAACACC TATAGAATAT CT CAGATGG C TAAAC 235 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 78 amino acids 

( 8 ) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Ser Tyr Phe Gly Cys Val Gin Cys lie Ser Gly Pro Leu Gly Met Tyr 

1 5 10 15 

Arg Asn Ser Leu Leu Gin Gin Phe Leu Glu Asp Trp Tyr His Gin Lys 

20 25 30 

Phe Leu Gly Ser Lys Cys Ser Phe Gly Asp Asp Arg His Leu Thr Asn 

35 40 . 45 

Arg Val Leu Ser Leu Gly Tyr Arg Thr Lys Tyr Thr Ala Arg Ser Lys 

50 55 60 

Cys Leu Thr Glu Thr Pro Thr Lys Tyr Leu Arg Trp Leu Asn 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 25: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 23 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

GTCCTACTTT GGCTGTGTGC AGTGTATTAG TGGGCCCTTG GGCATGTACC GCAACAGCCT 6 0 

CCTCCAGCAG TTCCTGGAGG ACTGGTACCA TCAGAAGTTC CTAGGCAGCA AGTGCAGCTT 120 
CGGGGATGAC CGGCACCTCA CCAACCGAGT CCTGAGCCTT GGCTACCGAA CTAAGTATAC 180 



(2) INFORMATION FOR SEQ ID NO: 20: 



<i) SEQUENCE CHARACTERISTICS : 
(A> LENGTH: S amino acids 
(B) TYPE: amino acid 
{ C ) STRANDEDNESS : s ingle 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 

Gly Trp Gly Thr Ser Gly Arg Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO:21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 
CCCGGGCAAG ATGGATTGTG AGAGGTTTCT ATGTGTCCTG 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 
CCCGGGTCAT ACATCAAGCA CCATGTCATA CTG 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 

Gly Asp Asp Arg His Leu Thr Asn 
1 5 

(2) INFORMATION FOR SEQ ID NO:16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 

Gin Gin Thr Arg Trp Thr Lys Ser Tyr Phe 
15 10 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
GCNTTYAAYG TNGARMGNGC NTGYCA 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
RTTNGTNARR TGNCKRTCRT CNCC 

(2) INFORMATION FOR SEQ ID NO: 19: 
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(A) LENGTH: 41 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
ID) TOPOLOGY: linear 

(ii) MOLECULE TYPE; peptide 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



Lys Arg Lys Ala Gin lie Ala Ala lie Gly Gin Ser Ser Gly Asp Leu 




Lys Leu Ala Ser Lys Met Arg Asp Pro 
35 40 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

Lys Lys Lys lie Asn Ser His Arg Trp Leu Phe Asn Ala Phe Cys Pro 

15 10 15 

Val Leu Gin Pro Thr Val Val Thr Leu Val Asp Val Gly Thr Arg Leu 

20 25 30 

Asn Asn Thr Ala lie Tyr Arg Leu Trp Lys Val Phe Asp Met Asp 
35 40 45 

(2) INFORMATION FOR SEQ ID NO : 14 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Ala Phe Asn Val Glu Arg Ala Cys Gin 
1 5 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

Lys Arg Glu Val Met Tyr Thr Ala Phe Lys Ala Leu Gly Asp Ser Val 

15 10 15 

Asp Tyr Val Gin Val Cys Asp Ser Asp Thr Arg Leu Asp Pro Met Ala 

20 25 30 

Leu Leu Glu Leu Val Arg Val Leu Asp Glu Asp 
35 40 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Lys Arg Glu Val Met Tyr Thr Ala Phe Gin Ala lie Gly Thr Ser Val 

15 10 15 

Asp Tyr Val Gin Val Cys Asp Ser Asp Thr Lys Leu Asp Glu Leu Ala 

20 25 30 

Thr Val Glu Met Val Lys Val Leu Glu Ser Asn 
35 40 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 11 : 

Lys Arg His Ala Gin Ala Trp Ala Phe Glu Arg Ser Asp Ala Asp Val 

15 10 15 

Phe Leu Thr Val Asp Ser Asp Thr Tyr lie Tyr Pro Asn Ala Leu Glu 

20 25 30 

Glu Leu Leu Lys Ser Phe Asn Asp Glu 
35 40 

(2) INFORMATION FOR SEQ ID NO: 12: 



(i) SEQUENCE CHARACTERISTICS : 
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355 360 365 

Thr Pro lie Asn Leu Phe Leu He Leu Pro Leu Lys Ala Tyr Ala Leu 

370 375 380 

Cys Thr Leu Ser Asn Ser Asp Trp Leu Ser Arg Tyr Ser Ala Pro Glu 
385 390 395 400 

Val Pro Val Ser Gly Gly Lys Gin Thr Pro He Gin Thr Ser Gly Arg 

405 410 415 

Val Thr Pro Asp Cys Thr Cys Ser Gly Glu 
420 425 




(B) TYPE: amino acid 

<C) STRANDEDNESS: single 

{ D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 



Lys Arg Glu Val Met Tyr Thr Ala Phe Arg Ala Leu Gly Arg Ser Val 

15 10 15 

Asp Tyr Val Gin Val Cys Asp Ser Asp Thr Met Leu Asp Pro Ala Ser 

20 25 30 

Ser Val Glu Met Val Lys Val Leu Glu Glu Asp 
35 40 



(2) INFORMATION FOR SEQ ID NO: 8: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



Gin Cys Ser Phe Gly Asp Asp Arg 

1 5 
Leu Gly Tyr Ala Thr Lys Tyr Thr 
20 

Thr Pro lie Glu Tyr Leu Arg Trp 

35 40 
Lys Ser Tyr Phe Arg Glu Trp 
50 55 



His Leu Thr Asn Arg Val Leu Ser 

10 15 
Ala Arg Ser Lys Cys Leu Thr Glu 
25 30 
Leu Asn Gin Gin Thr Arg Trp Ser 
45 



(2) INFORMATION FOR SEQ ID NO : 9 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 3 amino acids 

(B) TYPE: amino acid 
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